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Today’s Task for Economists’ 


By WILLIAM VICKREY’ 


I. The Allocation Task of Yesteryear 


Nearly two score years ago, on the occa- 
sion of Columbia’s bicentennial celebration, 
Sir Dennis Robertson gave an address enti- 
tled ““What Do Economists Economize,” the 
burden of which was that, since presumably 
economists are the most expert economiz- 
ers, they should economize the most pre- 
cious thing in the world, namely, love, or 
altruism. This would be done in part by so 
arranging things that in the ordinary con- 
duct of life individual choices made on the 
basis of self-interest in terms of market 
prices would at least be consistent with max- 
imizing social welfare, so that the exercise 
of scarce resources of altruism could be 
concentrated on situations where Adam 
Smith’s unseen hand could not be made to 
serve. To me, one implication of this was 
that economists should see to it that market 
prices correctly reflect the relevant marginal 
social cost of various alternatives. I have 
devoted a major part of my career to the 
promotion of such marginal-cost pricing, but 
thus far with a notable lack of practical 
success outside academia. 

At the time of Robertson’s address, in- 
deed, there was a certain euphoria prevail- 
ing among at least part of the economics 
profession over the prospect of curbing the 
business cycle and maintaining a high level 
of economic activity through Keynesian fis- 
cal policy. Under these circumstances it was 
reasonable to think that the chief remaining 
job of the economist was to assure a 
Pareto-efficient allocation of a given aggre- 
gate of resources. The event, however, 
proved otherwise. The conventional wisdom 


‘Presidential address prepared for the one-hundred 
fifth meeting of the American Economic Association, 
January 6, 1993, Anaheim, CA. 
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of regarding budget deficits as improvident 
prodigality, and government debt as the 
legacy of a craven deferral of burden to the 
future, resumed command. 

One eminent economist is said to have 
remarked, in effect, that it was the function 
of the science of public finance to see to it 
that nothing of importance is ever done or 
left undone merely for financial reasons. 
Alas, the financial reasons have thus far 
carried the day, and we have not had any- 
thing approaching real full employment 
since the Korean War, or indeed in peace- 
time at any time since 1925, if then, at least 
in terms of the Beveridge definition of full 
employment as a situation wherein there 
are at least as many unfilled job openings as 
there are unemployed individuals seeking 
work. 

In > Eisenhower years, the conven- 
tional wisdom held sway in spite of the 
absence of serious contraindications to the 
Keynesian prescription. In the 1960’s, the 
simple Keynesian analysis began to be called 
into question by the emergence of stagfla- 
tion, a phenomenon not contemplated by 
the earlier Keynesian models. A new rela- 
tionship, the Phillips curve, relating the 
evolution of inflation to the level of unem- 
ployment was added to the economists’ 
armamentarium, with its “non-inflation- 
accelerating rate of unemployment” or 
NIARU. 

This NIARU is of course not a fixed 
datum, but varies over time and place ac- 
cording to the sociopolitical ambience, the 
mechanics of the labor market, and the 
vigor of competition. It may have been ris- 
ing over time as a result of the increased 
sophistication and differentiation of prod- 
ucts, real and factitious, giving sellers, as 
the ones most knowledgeable about the 
characteristics of their products and their 
markets, considerable leeway to raise their 
prices without unacceptable loss of sales. 
This process is ultimately held in check only 
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by the presence of underutilized labor and 
other resources. Currently in the United 
States the NIARU appears to be around 
4—6 percent. 

In some quarters this NIARU has even 
been termed the “natural” rate of unem- 
ployment, in one of the most vicious eu- 
phemisms ever coined. Some have even gone 
so far as to define ‘full employment” as 
being the NIARU. But while 5-percent un- 
employment might be barely tolerable if it 
meant that everyone would be taking an 
additional two weeks of vacation every year 
without pay, it is totally unacceptable as a 
social goal when it means unemployment 
rates of 10, 20, or even 40 percent among 
disadvantaged groups, with resulting in- 
creases in poverty, homelessness, poor 
health, drug addiction, and crime. Yet the 
hard political fact is that at such a NIARU 
the great majority of the voting population, 
including most of the politically active up- 
per and middle classes, will have relatively 
little personal experience of severe unem- 
ployment, while nearly everyone will have 
some direct experience of inflation. Many 
seem to feel that if only prices would stop 
rising they would benefit correspondingly by 
having their income go further, giving rela- 
tively little thought to the effect on their 
incomes. Even those with large mortgages 
or other debts, who would actually gain 
from inflation, tend to concur in the notion 
that they suffer from it. It is thus extremely 
difficult to get political support for anti- 
unemployment measures that are perceived 
as involving a threat of inflation, at least 
until unemployment reaches 7 percent or 
more, at which point unemployment be- 
comes a more widespread threat. 

Actually it is the uncertainty as to the 
rate of inflation, and not its level, that does 
the damage. An assured, moderate rate of 
inflation can be adapted to by adjusting 
nominal rates of interest and the terms of 
long-term contracts involving money pay- 
ments. The “menu cost” of changing price 
tags and catalog quotations is probably less 
important than the mental effort required 
of consumers in forming an idea of what an 
appropriate current price is for infrequently 
purchased items, such as furniture or cloth- 
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ing. An inflation rate assured to stay be- 
tween 5 percent and 6 percent, say, might 
even have advantages. Monetary policy 
would be more powerful in stemming a 
downturn in that very low and even negative 
real rates of interest would become feasible 
as a stimulus to investment. It might in 
principle be easier to keep inflation within a 
l-percent range between 5 percent and 
6 percent, than to keep it within a 2-percent 
range between — 1 percent and +1 percent, 
given the smaller real value of non- 
interest-bearing moneys in circulation, even 
allowing for the superior political focusing 
power of a target of 0 percent as compared 
to one of 5.5 percent. 

The base of the income tax would be 
broadened also, making it possible to have a 
tax that is more progressive and more pro- 
ductive of revenue with lower marginal rates 
and less of a distortionary effect. A tax 
based on nominal accrued income would in 
effect be a tax on a base consisting of real 
income plus a percentage of net worth. 
While this is not what is meant by an ideo- 
logically pure income tax, in terms of its 
practical effects it can be deemed a superior 
tax. 

It is the possibility of substantial changes 
in the rate of inflation, either up or down, 
that does the damage. Such changes involve 
a disappointment of expectations and a re- 
distribution of wealth and income derived 
from a given national product that is capri- 
cious and often inequitable, but it does not 
of itself substantially reduce the amount to 
be distributed. Unemployment, on the other 
hand, directly and definitely reduces the 
total product to be distributed. Unantici- 
pated changes in the rate of inflation, up or 
down, may be considered to be a form of 
legitimized embezzlement, whereas unem- 
ployment is vandalism. 

Nevertheless, the stance of the politico- 
financial establishment is still to look at the 
bottom line as the ultimate reality, whether 
of the corporation or the national budget, 
and since money is the measure of all good 
and evil in this kind of calculus, anything 
that impugns the value of money is viewed 
as a kind of sacrilege reinforced by a lurking 
fear of starting down a slippery slope to 
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hyperinflation. We find the Federal Reserve 
System poised to slam on the brakes at the 
first sign of a resurgence of inflation, a 
posture not calculated to inspire investment 
in durable capital. 

On the political side, we see the House 
voting by a substantial majority in favor of a 
constitutional amendment to require a bal- 
anced budget, fortunately falling short of 
the required two-thirds. This was done in 
spite of the fact that the nominal budget as 
currently computed is not a valid measure 
of any significant economic quantity. The 
nominal deficit would be reduced by selling 
the Pentagon to a life insurance company 
subject to a long-term lease-back and repur- 
chase option; this at least would do no 
harm, unlike the sale of natural resources to 
private exploiters which would actually de- 
crease the real heritage handed down to the 
future, on the pretext of reducing the trans- 
fer requirements embodied in the national 
debt. 


II. Recycling Savings Through Public 
Capital Formation 


From a classical standpoint, of course, 
the difficulty is that no account is taken 
of the distinction between transactions on 
current account and on capital account. If 
AT&T, General Motors, and households 
had been constrained to operate under the 
restrictions of the proposed balanced-budget 
amendment, we would now have far fewer 
telephones, automobiles, and houses. A 
capital budget, with a vast expansion of 
government capital outlays on roads, 
bridges, research, education, and the like, 
financed by borrowing, might go a consider- 
able way toward improving the unemploy- 
ment situation. But there is no assurance 
that it could do the whole job. 


III. Eliminating the Corporation Income Tax 


Other classical approaches to improving 
the unemployment situation exist but have 
their own political opposition and in any 
case are too weak to make much of a dent 
in a very large need. One such measure 
would be the abolition of the corporation 
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income tax, which is by far the most serious 
hurdle in the way of private capital forma- 
tion of a kind requiring equity funding. Un- 
like the capital-gains tax, the corporate in- 
come tax is a tax largely above or before the 
market, requiring a rate of return on invest- 
ment sufficient to cover the corporation tax 
and leave a rate of return after tax compa- 
rable to other investments, whereas the 
capital-gains tax operates largely as a reduc- 
tion in the return to the investor after or 
below the market, comparable to the reduc- 
tion of net income to the taxpayer resulting 
from the personal income tax on other in- 
come. In addition, the corporation tax 
causes inefficient allocation of investment 
between equity-type and loan-type invest- 
ments; it encourages thin equity and result- 
ing bankruptcies and reorganizations, and it 
lubricates takeovers and mergers of dubious 
intrinsic merit. 

Reduction of the tax on capital gains, on 
the other hand, might actually depress eco- 
nomic activity if the additional savings out 
of the tax reduction were to exceed the 
additional capital formation induced. This is 
the more likely in that most of the tax 
reduction is likely to be saved immediately, 
whereas the inducement to capital forma- 
tion is in terms of a tax reduction in a 
relatively remote future, subject to legisla- 
tive vicissitudes. At best, special treatment 
of capital gains greatly increases the com- 
plexity of the tax law and diverts investment 
flows from their most efficient use. There is 
nothing to indicate that investments likely 
to yield returns in forms defined by the tax 
code as capital gains will have any superior 
social value: gains from land speculation, in 
particular, add nothing to the real availabil- 
ity of resources. 

As for the corporate income tax, in spite 
of its many defects from the standpoint of 
economic efficiency, it has enormous politi- 
cal popularity due to the fact that nearly 
everyone thinks that it is paid by someone 
else. Indeed economists have differed widely 
in their assignment of the “burden” of the 
tax, owing to a failure to specify, or even to 
consider, the macroeconomic policy changes 
necessarily involved in a change in the tax. 
Unlike most other taxes, the corporation tax 
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inflicts a double whammy on the economy 
in that it both extracts income from the 
stream of purchasing power and reduces the 
recycling of savings through investment. If 
imposed on a revenue-neutral basis it causes 
unemployment, while if a budgetary adjust- 
ment is made to maintain employment con- 
stant, its burden can be thought of as falling 
on future wage earners, who will have less 
capital with which to work. 

Problems of the deferral of income 
through undistributed profits, as well as the 
deferral of taxation to the time of realiza- 
tion of capital gains, would ideally be met 
by putting the personal income tax on a 
cumulative basis, along lines I developed 
while working with Carl Shoup in 1938, 
whereby the deferral of the reporting of 
income, by whatever means, merely involves 
the borrowing of the deferred tax at a suit- 
able rate of interest. About two-thirds of 
the internal revenue code would become 
redundant, with the possible exception of 
the need to deal with the international jet 
set and revolving-door marriages; large 
numbers of tax techies would be able to 
apply themselves to more productive em- 
ployment. 

Failing this, an approximation to a level 
playing field might be had by imposing a 
small annual tax on the accumulated undis- 
tributed surplus of corporations, roughly 
equal to the interest on the stockholders’ 
postponed individual income tax. Similarly, 
there should be a surcharge on realized 
capital gains, proportionate to the length of 
time held, to offset the gain from the defer- 
ral of the tax. 

If there is nevertheless a need to cater to 
a political demand for something that can 
be labeled a corporation tax, this might be 
satisfied by levying a corporation tax on 
dividends, interest, and retained earnings at 
a rate corresponding to the first-bracket rate 
of the individual income tax and exempting 
such interest and dividends from this “‘nor- 
mal” rate, going back to the pre-1934 prac- 
tice of dividing the income tax into a normal 
tax and a progressive surtax. To even things 
up neatly, normal tax paid on other forms of 
income should be deductible in computing 
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the base for the progressive surtax paid by a 
minority of taxpayers. It would still be ap- 
propriate to have an undistributed surplus 
tax to correspond to this surtax. 


IV. Tax-Exempt Bonds 


Another measure that might slightly im- 
prove investment allocation would be to re- 
place the exemption of interest on state and 
local bonds by a taxable tax credit at a rate 
that would maintain the market value of the 
bonds. Low-bracket taxpayers would be lit- 
tle affected, while the entire loss of revenue 
to th. Treasury would accrue as a subsidy to 
the issuers. Upper-bracket taxpayers would 
no longer have an incentive to invest in such 
bonds rather than in riskier investments 
more suitable to their status. 


V. Taxing Imputed Income 


A more important but politically more 
difficult measure would be to require the 
inclusion in taxable income of the rental 
value of owner-occupied residences. This 
would not only improve the equity and pro- 
gressivity of the income tax but go a sub- 
stantial way toward making more units 
available for rental and, to a modest extent, 
promoting the construction of additional 
affordable rental housing and abating the 
problem of homelessness. A similar case 
can be made for including in the income tax 
base a net rental value of nonbusiness auto- 
mobiles (equal to interest on the market 
value of the car), in this case reducing the 
discrimination against the use of public 
transit. 


VI. Shifting Property Taxes from 
Improvements to Land 


A measure that could provide a powerful 
stimulus to investment in property improve- 
ments would be to replace part or all of the 
property tax by a tax on land value only, a 
proposal that can be traced all the way back 
to Francois Quesnay and the French phys- 
iocrats but which is more recently associ- 
ated with the name of Henry George. This 
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would remove the very serious deterrent 
effect of the property tax on improvements. 
Unfortunately from the standpoint of a na- 
tional employment policy, this tax is largely 
levied by local governments, which are often 
constrained by constitutional provisions or 
state laws. Nevertheless some means of 
bringing pressure to bear on these govern- 
ments to make this change might be found. 
Some Pennsylvania governments are already 
doing this. When levied for municipal pur- 
poses, it might be appropriate to exempt 
from the tax a flat amount per square foot 
as representing the value of circumambient 
agricultural land for which the urban gov- 
ernment can claim no credit; this would also 
mitigate discriminations at jurisdictional 
boundaries. 

It is perhaps worth noting that the signif- 
icance of a government debt would be dras- 
tically different in a community relying ex- 
clusively on a land-value tax. Such a debt 
would in effect be a collective mortgage on 
the land, especially if it can be assumed that 
land values in the community will vary pro- 
portionately over time. Since the interest on 
the community debt will generally be lower 
than interest charged on individual mort- 
gages, it can be in the general interest of all 
the taxpayers of the community for the gov- 
ernment to borrow as much as the market 
will take, even to finance current outlays, 
provided a suitable margin is left to deal 
with emergencies. On the other hand such 
debt financing performs no recycling of sav- 
ings, there is no room for Keynesian fiscal 
policy, and Ricardian equivalence is in full 
sway. This does not detract, however, from 
the powerful stimulating effect of a reduc- 
tion in the tax on improvements. 


VII. Limitations of ““Supply-Side’”’ Measures 


Under current conditions, however, such 
“supply-side” measures designed to operate 
by reducing the cost of capital are likely to 
be severely limited in their effect as long as 
nearly all types of capital facilities are idle 
or underutilized. Very little “widening” in- 
vestment is likely to take place as long as 
there is excess capacity in place. At most, 
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some “deepening” investment in new prod- 
ucts or technologies may take place, or there 
may be corners of the economy where rela- 
tively rapid growth has kept capacity fully 
utilized. Even in such cases, investment in 
capital facilities may depend more on ap- 
praisals of an uncertain market for the 
product than on the cost of capital. 

This is likely to be true not only of tax 
policy but even more of monetary policy. In 
any attempt to emerge from present rates of 
unemployment even only down to the 
NIARU within any reasonable time period, 
monetary policy is likely to prove a weak 
reed, sometimes aptly described as pushing 
on a string. The main difficulty is that mon- 
etary policy bears primarily on short-term 
interest rates and credit availability and in 
its usual practice does not directly control 
long-term rates, which are the important 
rates for most decisions involving real 
durable capital formation. The posture of 
the Federal Reserve System in holding itself 
ready to slam on the brakes at the first sign 
of resurgent inflation is poorly adapted to 
bringing long-term rates down. It does not 
appear that the Fed has either the will or 
the resources to do enough about long-term 
rates to do very much to increase capital 
formation, especially when idle and under- 
used capacity pervades much of the econ- 
omy. 


VIII. Savings Recycling by Government 


This brings us inevitably around to fiscal 
policy. Here it is necessary to stop thinking 
of the conventional nominal budget deficit, 
or even of a current-account deficit in a 
budget drawn up in terms of distinguishing 
capital and current-account items, and to 
start thinking of fiscal policy in terms of its 
role in recycling savings, in excess of what is 
recycled by private investment, into the 
stream of purchasing power. The conven- 
tional wisdom seems to argue that increased 
employment requires that the economy 
grow, growth requires investment, and in- 
vestment requires savings; therefore let’s 
encourage saving through IRA’s, tax expen- 
diture rather than income, and tighten our 
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belts to restore the economy to its normal 
state of health. 

It doesn’t work that way. Savings are not 
like a sack of potatoes which if not sold at 
the current price will stay on hand and put 
a downward pressure on the price until sold. 
Savings not immediately taken up to create 
capital simply vanish in reduced income, 
without even exerting a downward pressure 
on interest rates. If I yield to the allure- 
ments of tax concessions to IRA’s to the 
point of not having my hair cut, this puts $8 
more in my bank account, but $8 less in the 
barber’s account; there is nothing that makes 
it any easier for anyone to obtain funds with 
which to create capital, nor anything that 
makes the prospect more attractive. As 
Gertrude Stein remarked, “the money is 
always there, it’s the pockets that keep 
changing.” If the barber reacts by curtaling 
his consumption, this further reduces na- 
tional income and saving. I may succeed in 
my attempt to save, but only by reducing the 
saving of others by even more. Savings are 
an extrcmely perishable entity. Say’s law 
fails as soon as part of the income gener- 
ated in the process of producing the supply 
is shunted off into savings that fail to get 
converted into new capital goods. 

On the other hand, if some genius invents 
a new product or process and obtains a 
credit or borrows the funds needed to fi- 
nance the capital involved in its production, 
this added real wealth is, ipso facto, some- 
one’s saving. Instead of Say’s law, we have 
“capital formation creates its own saving.” 
Similarly, if the government borrows funds 
created by credit expansion and recycles 
them into purchasing power through out- 
lays, whether on current or capital account, 
this creates both income out of which addi- 
tional savings will be attempted and de- 
mand that may induce the private invest- 
ment to meet it. 

Not all deficit financing, however, results 
in recycling of savings, whether measured by 
the current capriciously defined nominal 
deficit or by a more rational definition in- 
volving accounting for government assets. 
We have seen that in a community relying 
exclusively on a land tax, recycling does not 
take place. Nor would the sale of the Pen- 
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tagon, or the purchase of an office building 
currently being rented by the government, 
offset by bond transactions, involve any 
change in the level of recycling. Government 
recycling is in principle the excess of those 
government outlays that are regarded by 
their recipients as income over those govern- 
ment receipts that are regarded by their 
payors as reductions in their disposable in- 
come. Even this is subject to some caveats: if 
government investment in a power plant, for 
example, substitutes for investment that 
would otherwise have been made by private 
enterprise, there is no net recycling. 

On the whole, however, recycling tends to 
vary in rough correlation with the nominal 
deficit, and the strength of the notion in the 
minds of the public and their representa- 
tives that deficits are bad and that the 
“budget” should be balanced may make it 
difficult to achieve an adequate level of 
recycling. Some help in this respect may be 
obtained by going to a capital budget sys- 
tem, in which balance would be sought only 
for the current-account part of the budget, 
borrowing for the capital account being jus- 
tified by comparisons with corresponding 
private practices and by the thought that 
future generations being burdened with the 
debt would also reap benefits from the capi- 
tal passed on to them. While this may con- 
strain choice away from what rational voters 
would have chosen as the optimal level of 
government capital formation, there would 
seem to be sufficient scope for government 
capital investment to provide sufficient recy- 
cling to bring about full employment, partic- 
ularly if investments in education, research, 
space exploration, and the like are consid- 
ered eligible for treatment as capital invest- 
ment. Some of these projects. even if they 
would not stand scrutiny aside from their 
function in justifying income recycling, may 
nevertheless have the same kind of justifi- 
cation as the building of the Egyptian pyra- 
mids had for Keynes. On general welfare 
grounds, one might well prefer recycling in 
terms of borrowing to finance health care to 
borrowing to finance space stations, but if 
borrowing for health care is deemed to cre- 
ate an ideologically sinful current-account 
deficit, space stations it will have to be. 


VOL. 83 NO. 1 


IX. The Need for Direct Inflation Control 


Long before the economy reaches a really 
satisfactory level of full employment, how- 
ever, as employment gets to the NIARU 
level, and inflation threatens to accelerate, 
the Fed is likely to try to slam on the 
brakes, and demands for a more stringent 
budget balancing and cutback of ‘“‘govern- 
ment waste” are likely to be heard in the 
halls of Congress. To get anywhere near a 
satisfactory level of unemployment, some 
method of dealing with inflation will have to 
be devised. We are short of tools. 

In effect, the economy can be thought of 
as having three major parameters that we 
would like to control: the level of employ- 
ment of human and other resources, the 
price level, and the division of the resulting 
total product between provision for current 
wants and investment in growth and the 
future. At the same time, we have only two 
major policy tools: monetary and fiscal pol- 
icy. In an era when inflation was not a 
threat, one could think of these two tools as 
controlling the level of employment and the 
rate of growth, with low interest rates com- 
bined with a deficit or surplus sufficient to 
maintain full employment leading to high 
investment and growth, and conversely. 
However, with a need to control inflation as 
well, relying on only two dimensions of con- 
trol is like trying to fly an airplane without 
ailerons, which were the third dimension of 
control that was the key to the success of 
the Wright brothers. A new tool is needed. 

Over the past three decades a number of 
proposals for direct control of inflation have 
been made, but none has achieved general 
acceptance. Wartime control of specific 
prices, accompanied by rationing, was ac- 
cepted as an emergency measure and 
worked in part because of patriotic willing- 
ness to conform and in part because, being 
temporary, past prices could be continued 
without becoming absurd. As a _ perma- 
nent scheme this is probably unworkable 
and certainly unacceptable. More recent 
schemes have involved tax incentives of 
various kinds to provide a countervailing 
downward pressure against the inherent 
inflationary tendency of an_ imperfectly 
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competitive system. Such schemes have 
generally suffered from difficulties in mea- 
suring price changes at an individual-firm 
level, capriciousness of results when tied to 
such taxes as the corporation income tax, 
and possible time lags in adjusting the 
strength of the incentives to changing cir- 
cumstances. 


X. Market-Based Inflation-Control Plans 


A few years ago David Colander came to 
visit me and reported on a proposal by 
Abba Lerner for a market in rights to raise 
prices. Those wishing to raise their prices 
would be required to purchase the right 
from those prepared to lower their prices, 
thus assuring a constant overall price level. 
While this neatly circumvents the problem 
of adjusting the strength of incentive to 
changing inflationary pressures, the prob- 
lem remains of how to measure price 
changes in the face of quality changes, new 
products, and variations in the terms of sale 
such as delivery, reliability, service, credit 
terms, tie-in sales, and the like. 

More pregnant was the question of how 
to deal with cases in which prices paid to 
suppliers have risen. A somewhat similar 
problem arises with gross receipts taxes, 
which discriminate in favor of vertically in- 
tegrated operations and against situations in 
which the product passes through several 
hands on the way to the market. In Europe 
this problem has been solved by shifting 
from gross receipts taxes and retail sales 
taxes to value-added taxes, which immedi- 
ately suggests that instead of a market in 
rights to raise prices we have a market in 
rights to value added. 


XI. Control with Marketable Gross 
Markup Warrants 


For semantic reasons I have chosen to 
speak in terms of “gross markups” rather 
than value added, as being more suggestive 
of something to be restrained rather than 
promoted. In principle, gross markups sim- 
ply refers to the excess of sales revenue over 
amounts paid for nonprime inputs. In oper- 
ation, warrants for gross markups for a 
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prospective accounting period would be is- 
sued to each firm on the basis of the gross 
markups for a corresponding preceding pe- 
riod, plus or minus adjustments for changes 
in prime inputs such as labor and invested 
capital. These warrants would be issued in 
sufficient total face value to correspond to 
the value at a desired price level of the 
output expected to be produced by the in- 
puts against which the warrants were issued. 
They would be freely tradable for cash in a 
competitive market, and if at the end of the 
accounting period a firm is found to have 
retained or acquired fewer warrants than 
the actual amount of its gross markups for 
the period, a penalty tax would be assessed. 
This tax would not be a substantial source 
of revenue, but would serve merely as an 
enforcement device. It could be set at a 
level fairly certain to be higher than the 
market price of the warrants. 

Adjustment of the warrant issue for 
changes in investment could be made simply 
on the basis of a uniform percentage of 
such change. Adjustment for changes in em- 
ployment is somewhat more difficult: a flat 
amount per employee or man-hour takes 
too little account of variations in qualifica- 
tions, while to allow adjustments equal to 
payrolls would run a danger of allowing 
inflationary wage increases. Some formula 
such as a percentage of payrolls plus a flat 
amount per employee might be satisfactory; 
such a formula would involve a certain bias 
in favor of the employment of low-skill la- 
bor, which may be considered desirable in 
view of the fact that this is where the unem- 
ployment problem is most serious. 

Administration would seem to pose no 
insurmountable problems. Determination of 
gross markups is essentially no different than 
the assessment of a value-added tax such as 
is widespread in Europe. Adjustment for 
investment can be made on the basis of 
accounts already needed for income-tax 
purposes, while adjustments for employ- 
ment can be related to the social-security 
records. Some special methods may have to 
be developed for dealing with the self- 
employed and very small firms, and possibly 
some classes of firms could be excluded 
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from the scheme, as is sometimes done with 
the value-added tax. 


XII. Prospects for Rapidly Reaching 
Genuine Full Employment 


With such a scheme in place, what can we 
plan for in terms of getting from where we 
are to full employment? Currently unem- 
ployment is reported as about 7.5 percent, 
and full employment can be reckoned at 
about 1.5 percent, giving a slack to be made 
up of 6 percent. Using Okun’s ratio of per- 
centage change in GNP to _ percentage 
change in reported unemployment of 2.5, 
we have a slack of 15 percent to be made 
up. If this slack can be taken up within two 
years, this will be 7.5 percent per year; if to 
this we add 2.5 percent for growth in the 
labor force and in productivity, we get 10- 
percent annual growth in GNP over two 
years. After two years, we hit the full- 
employment ceiling, and growth thereafter 
will be limited to the labor force and pro- 
ductivity factor, possibly between 2 percent 
and 4 percent. 

Is public finance up to the job of reaching 
the goals thus defined in terms of the limits 
of our real resources? Possibly, but it re- 
quires breaking new ground. One would 
have to begin with increasing government 
recycling as rapidly as possible by 8—10 per- 
cent of GNP in order to inaugurate the 
10-percent growth rate. How rapidly this 
could be done would of course depend on 
the political and legislative ambience. From 
some points of view the fastest and easiest 
way to do this is by tax cuts. Unfortunately, 
if tax cuts are temporary they tend to be 
viewed as windfalls to be saved rather than 
spent, so that only part of the tax cuts are 
effectively recycled. Alternatively, if not an- 
nounced as temporary, tax cuts tend to cre- 
ate a resistance to later tax increases called 
for by full-employment conditions and large 
debt-service requirements. This is especially 
threatening in the present context of politi- 
cal campaigning on the basis of promises of 
no new taxes. Perhaps the best tax cut would 
be a cut in the payroll taxes, as promising 
the maximum proportion of recycling, if this 
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can be done in the face of outcries that this 
would be jeopardizing the financial sound- 
ness of the social-security system. 

Outlays on actual programs, on the other 
hand, are somewhat harder to start and 
stop rapidly. There is also the need not to 
get too far ahead of the effective operation 
of whatever anti-inflation program is put in 
place, whether the program of gross markup 
warrants proposed above or some other, lest 
anticipatory speculation and inflation get 
out of hand. The exact program for the 
start-up period will require careful study. 

What happens after the first few months 
will depend to a large extent on what Keynes 
called the “animal spirits” of the financial 
community. At one extreme there could be 
such horror and alarm at the violation of 
the conventional wisdom concerning the 
sinfulness of deficits as to produce a 


widespread hibernation and flight to foreign 
shores. More likely, once the financial com- 
munity has become convinced of the seri- 
ousness of the administration’s purpose to 
bring about full employment, and once it is 
anticipated that demand will shortly use up 


the spare capacity of existing productive 
facilities, private capital formation may pick 
up to the point of absorbing and recycling 
individual savings sufficiently so that gov- 
ernment recycling may for the time being 
become unnecessary. At the same time, gov- 
ernment revenues from increased GNP will 
increase and outlays for unemployment in- 
surance and welfare will decrease. Also, 
there may be a need to shut down those 
governmental programs that compete for 
real resources with private capital forma- 
tion, in order to avoid a real “crowding out” 
(as contrasted with the financial crowding 
out alleged to occur as a result of govern- 
ment borrowing associated with a tax cut). 
As a result, a brief period of budget balance 
or even of surplus may become appropriate. 

As the economy hits the ceiling of full 
employment, however, still another transi- 
tion becomes necessary. For a while capital 
formation may continue on its momentum, 
recycling savings but producing excess ca- 
pacity that either cannot find labor with 
which to operate or cannot find markets in 
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which to sell its product. Within a short 
time after hitting the full-employment ceil- 
ing, capital formation will have to drop from 
that appropriate to a 10-percent growth rate 
to that suited to a far slower growth rate. At 
this point attempted savings may again ex- 
ceed what can be absorbed by private capi- 
tal formation, even at very low rates of 
interest. Other ways to recycle the excess 
will again become necessary, one of which 
will be renewed government recycling. 


XIII. Long-Term Excess of Demand Saving 
over Private Investment 


There is, indeed, no principle of eco- 
nomics that says that there will always be a 
feasible rate of interest that will equate 
desired savings and private capital-forma- 
tion under conditions of steady full employ- 
ment. Current trends seem to be such as to 
make such a possibility unlikely. One factor 
has been a spate of capital-saving innova- 
tions and practices. Fiber optics, when fully 
utilized, costs less per unit of service than 
previous technologies by orders of magni- 
tude, leaving ductways planned for copper 
conductors forever surplus; electronic ex- 
changes occupy a fraction of the space 
formerly required by equivalent electro- 
mechanical exchanges; just-in-time practices 
reduce investment in inventory; improved 
communications enable more freight to be 
carried on a single track line with sidings 
than was formerly carried by a full two-track 
line; a man assembling electronic gear with 
a soldering iron uses far less capital than 
the man in the pulpit of a rolling mill, and 
service industries generally use less capital 
per employee than manufacturing, mining, 
or transportation. 

Moreover, before gross investment can 
begin to recycle private savings, it must first 
recycle funds set aside in depreciation, 
amortization, depletion, and obsolescence 
charges, while rapid obsolescence due to 
accelerating technological progress makes 
capital formation relatively insensitive to 
changes in interest rates. Very low or nega- 
tive interest rates may stimulate investment 
in nondepreciating assets such as land, but 
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even this is limited by the possibility that 
speculative bubbles may burst, and in any 
case relatively little recycling is produced 
thereby, except to the extent that the en- 
hanced asset values cause owners to feel 
wealthier and spend more. 

On the savings side, increased longevity 
and the high cost of old-age illness lead to 
increased savings through funded pensions 
and other provisions for retirement. For this 
purpose, the lower the rate of interest, the 
greater is the amount of current savings that 
must be put aside to provide a given level of 
retirement security. More recently the in- 
creased concentration of income among the 
very wealthy, who have a high propensity to 
save, not so much for eventual consumption 
but largely to accumulate chips with which 
to play financial games and exercise eco- 
nomic power, has further added to the sav- 
ings-recycling problem. Some recycling may 
take place through investment abroad, re- 
flected in a positive trade balance and the 
production of goods for export, though it is 
uncertain how far this can be carried in the 
face of political instability, the danger of 
creating repayment problems, and the resis- 
tance of foreign governments that do not 
have an effective full-employment policy of 
their own to our exporting our unemploy- 
ment to them in this way. 

On balance, it may prove impossible, for 
the foreseeable future, to maintain a steady 
state of genuinely full employment without 
a substantial amount of government recy- 
cling of savings, a chronic budget deficit, 
and a long-term increasing trend in the na- 
tional debt, however distasteful this may be 
to those ideologically addicted to a bal- 
anced budget. It may even prove necessary 
for the debt to grow at a rate faster than the 
growth of GNP. The burden of servicing 
this debt might be kept within bounds by 
reducing real interest rates, close to zero if 
need be, though this might imply a higher 
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level of private investment than would be 
chosen on its own merits. Even contemplat- 
ing such prospects calls for a significant 
expansion in our range of habitual thought. 


XIV. The Task Before Us 


This, then, is the challenge I lay before 
the economics profession. There is no rea- 
son inherent in the real resources available 
to us why we cannot move rapidly within 
the next two or three years to a state of 
genuinely full employment and then con- 
tinue indefinitely at that level. We would 
then enjoy a major reduction in the ills of 
poverty, homelessness, sickness, and crime 
that this would entail. We might also see 
less resistance to reductions in military ex- 
penditure, to liberalization of trade and im- 
migration policy, and to conservation and 
environmental protection programs. 

I lay before you a plan I believe can 
accomplish this. It involves government re- 
cycling of excess savings plus a method of 
keeping inflation under control. I believe it 
can do the job while preserving the essen- 
tials of a free-market system. There may be 
some details to be worked out, but I am 
confident that the basic concept is sound 
and workable. 

We simply cannot carry on as we have 
been doing without falling apart as a com- 
munity and losing what is left of our status 
of world leadership. If you don’t think that 
something like this can be made to work, 
then it is up to us to get together to find 
something that will. Otherwise, if we con- 
tinue to tie our hands with financial shibbo- 
leths and models that tacitly assume a fixed 
total of resource utilization, we are no bet- 
ter than the feckless castaway whose contri- 
bution to the solution of the problem of 
dealing with cases of canned goods was “‘let’s 
just assume we have a can-opener.” 


Trigger Points and Budget Cuts: 
Explaining the Effects of Fiscal Austerity 


By GiusepPpE BERTOLA AND ALLAN DRAZEN* 


We propose and solve an optimizing model which explains counterintuitive 
effects of fiscal policy in terms of expectations. If government spending follows 
an upward-trending stochastic process which the public believes may fall sharply 
when it reaches specific “trigger” points, then optimizing consumption behavior 
and simple budget-constraint arithmetic imply a nonlinear relationship between 


private consumption and government spending. 


This theoretical relation is 


consistent with the experience of several countries. (JEL E62, E21) 


Textbook aggregate-demand models sug- 
gest simple relationships between the gov- 
ernment budget and economic activity: a cut 
in the government deficit, for example, de- 
pressing consumption and output. Though 
such models heavily influence the design of 
stabilization policies, their micro founda- 
tions are unclear, and their sharp predic- 
tions are not always consistent with reality. 
In many countries, large cuts in government 
spending carried out as part of stabilization 
programs have led to expansions rather than 
contractions in economic activity. 

The absence of simple relationships be- 
tween economic variables is often explained 
theoretically by the importance of expecta- 
tions of future economic variables, including 
variables reflecting government policy. The 
idea that expectations of future policy 
changes are crucial in understanding seem- 
ingly counterintuitive macroeconomic dy- 
namics has been explored before. It formed 
the basis of the now classic “Some Un- 
pleasant Monetarist Arithmetic’ (Thomas 
J. Sargent and Neil Wallace, 1981). More 
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recently, Drazen and Elhanan Helpman 
considered the effects of future anticipated 
fiscal changes meant to balance the budget 
on current inflation in a closed economy 
(1990) and the current account in an open 
economy (1987). Their results indicated that, 
depending on what sort of policy change 
was anticipated, any correlation between in- 
flation (or the current account in an open 
economy) and changes in the budget deficit 
could be observed. 

In this paper we argue that expectations 
about the discrete character of future fiscal 
adjustments can help explain the effects of 
current fiscal policy. The paper develops a 
model in which significant government 
spending cuts take place only when the ra- 
tio of government spending to output hits a 
trigger point, so that sharp policy changes 
occur only infrequently. We use techniques 
developed in the exchange-rate literature to 
study such discrete policy interventions. The 
existence of triggers in a neoclassical model 
implies a nonlinear relation between the 
consumption-to-output ratio and the gov- 
ernment-spending-to-output ratio. Both the 
assumptions about the character of fiscal 
interventions and the implications for the 
ratio of consumption to output find support 
in the joint behavior of consumption and 
government spending observed for several 
countries. We will look in detail at Den- 
mark and Ireland, where a pattern typical 
of the relation between private and govern- 
ment consumption in several countries 
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emerges quite strongly: when government 
spending is a small fraction of GDP, private 
and government consumption (both relative 
to GDP) exhibit an inverse relationship, 
which flattens out as the ratio of govern- 
ment spending to output increases. 

The key characteristic of the expectations 
view of fiscal policy is that nonstandard 
effects of fiscal policy are explained by the 
role of current policy in shaping expecta- 
tions of future policy changes. A policy in- 
novation that would be contractionary in a 
static model may be expansionary if it in- 
duces sufficiently strong expectations of 
future policy changes in the opposite direc- 
tion. If a cut in government spending in- 
duces expectations that future spending and 
therefore taxes will be significantly lower, it 
may induce an expansion in current private 
spending. Taken one step further, the effect 
of increases in government spending may be 
expansionary if private agents’ behavior is 
based on the expectation that high levels of 
spending are unsustainable and will soon be 
cut. 

The plan of the paper is as follows. Sec- 
tion I reviews the Danish and Irish experi- 
ences. In Section II we present a very sim- 
ple reference model in which output is 
constant and government spending follows a 
Brownian-motion process with positive drift 
subject to possible discrete adjustment: 
these simple assumptions, which are sug- 
gested by the Danish and Irish experiences, 
make it possible to obtain a closed-form 
solution and to illustrate the general phe- 
nomena we focus on in this paper. In Sec- 
tion III we propose a solution method for 
models of stochastic policy changes, and we 
show that the simple model implies a non- 
linear relation between the levei of con- 
sumption chosen by optimizing agents and 
the level of government consumption. We 
discuss the model’s implications for con- 
sumption behavior and the current account 
in Section IV. In Section V we discuss the 
match of the theory to the qualitative fea- 
tures of the experiences of Denmark, Ire- 
land, and other countries and consider how 
the simple model may be extended to better 
match observed phenomena. The final sec- 
tion contains concluding comments. 
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I. Two Examples of Fiscal Stabilizations 


Numerous countries undertook fiscal aus- 
terity programs in the 1980’s. Denmark and 
Ireland enacted stabilization programs 
marked by especially sharp cuts in govern- 
ment expenditures during this period. We 
begin with a very quick overview of these 
episodes, based on the excellent analysis by 
Francesco Giavazzi and Marco Pagano 
(1990). 

In the early 1980’s the Danish primary 
government deficit was large relative to 
GDP, and public debt was growing rapidly. 
In the autumn of 1982 concerns about the 
sustainability of the fiscal path led to a 
decision to undertake a sharp fiscal contrac- 
tion beginning in late 1982. In the four 
years that followed, the full-employment 
primary deficit fell by 10 percent of GDP. 
The stabilization was expansionary: the con- 
sumption-to-GDP ratio rose by several per- 
centage points during 1982-1986, as shown 
in Figure 1, where we plot the time paths of 
private consumption c, government con- 
sumption g, and current account x (all rela- 
tive to GDP).! 

In the early 1980’s Ireland was in an even 
worse position. Giavazzi and Pagano (1990) 
report a large budget deficit and total na- 
tional debt equal to 87 percent of GDP. 
The first attempt to stabilize was under- 
taken in 1982. By 1984, the (full-employ- 
ment) primary budget deficit was reduced 
by 7 percent of GDP, largely via increased 
taxes. The fiscal contraction had the text- 
book Keynesian effect, with private con- 
sumption falling sharply. A second attempt 
at stabilization was made in 1987, this time 
relying on large cuts in government con- 
sumption. As in the Danish case, this fiscal 
retrenchment was quite sharp, with the 
full-employment budget deficit falling by 
another 7 percent of GDP between 1987 
and 1989. A further similarity to Denmark 
was the expansionary effect of the fiscal 
change. The data are presented in Figure 2. 


'The data are from the International Monetary Fund 
International Financial Statistics tape: lines 96f, 91f, 90c 
minus 98c, divided by line 99b. 
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In both countries, we see a similar rela- 
tion between the private consumption- 
to-output ratio c and the government 
consumption-to-output ratio g in the 1957- 
1989 period, displayed in Figures 3 and 
4 for Denmark and Ireland, respectively. 
(A similar relation emerges if we use gov- 
ernment expenditure inclusive of transfers 
and debt service instead of government con- 
sumption.) From 1957 until the early to mid 
1970’s we see an inverse relation between c 
and g in both countries; from the early 
1970’s until the early 1980’s c remains flat 
even though g rises significantly; finally, 
from 1982-1983 onward there is a signifi- 
cant drop in g with little change in c. Note 
also that the tax-based stabilization of 1982 
in Ireland is associated with a sharp drop in 
c aS g remains constant. The figures sug- 
gest, first, that fiscal stabilization takes the 
form of a sharp retrenchment, following a 
long upward drift in the ratio of government 
spending to output; and second, that its 
effects on private consumption decisions are 
not as simple as textbook models would 
suggest. 


II. A Simple Model 


We consider a nonmonetary small open 
economy facing a perfect world capital mar- 
ket. Let the world interest rate be r and let 
this also be the consumer’s discount rate. 
Suppose that all goods are tradable, that 
output of tradables is fixed, and that the 
government finances its purchases of trad- 
able goods by lump-sum taxes. (We discuss 
below the implications of assuming instead 
that part of government expenditure falls on 
nontraded goods, or that taxation is distor- 
tionary.) Government spending is assumed 
to follow a known process, which we will 
specify below. Here, we will relate current 
consumption of the representative individ- 
ual to the expected present discounted value 
of future government spending. 

The individual’s choice problem may be 
written as 


(1) max dz 


t 
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subject to 


(2) | (C, +7, 


where {C.} and {r,} are, respectively, the 
consumption and taxation processes; Y is 
(constant) income; A, is total private assets, 
including claims on foreigners and on the 
government; and E,{-} denotes expectation 
conditional on the relevant information 
available at time f¢. 

If government expenditures are stochas- 
tic, so are taxes via the government’s budget 
constraint. The first-order condition for the 
problem in (1) and (2) is 


(3) Efu'(C,)}=A, 


where A, is the multiplier on the integral 
budget constraint. If utility is quadratic in 
C, the first-order condition implies constant 
expected consumption over time, E,{C_}= 
C, for all z>t.* Taking expectations on 
both sides of the integral budget constraint, 
we obtain 


(4) C,=Y+rA,- rf dz. 
t 


The government’s intertemporal budget 
constraint implies 


(5) [ dz B, +f dz 


t 


where {G_} is the government spending pro- 
cess and B, is government debt (i.e., cumu- 
lated past deficits). Substituting (5) into (4) 
we obtain 


(6) C,=Y+r(A,-—B,) 


- rf dz. 
t 


Our interest is in the inverse relation between 
current consumption and expected future taxes for a 
forward-looking consumer. This relation is obviously 
general; we use quadratic utility to derive a simple 
functional form. 
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We note at this point that A, — B, equals 
the country’s net foreign assets, since we do 
not consider capital accumulation. Dividing 
through by output, we obtain a simple in- 
verse relation between the consumption-to- 
output ratio and the present discounted 
value of the ratio of future government ex- 
penditure to output, given the ratio f, of 
net foreign assets to GDP: 


(6a) 


where v, denotes the present discounted 
value of the ratio of expected future govern- 
ment spending to output; that is, 


(1) y= Efe je re Pde. 


Expectations are unobservable as such. 
To obtain a relation between (observable) 
consumption and government spending, we 
need to specify a process for the latter and 
to find how expectations of its present dis- 
counted value depend on current observ- 
ables. A number of state variables may be 
relevant for this purpose. In general, how- 
ever, any movement in current government 
spending must imply a change in future 
spending or taxation to ensure sustainabil- 
ity. To see this, consider a case in which 
changes in current government spending do 
not give rise to expectations of future off- 
setting movements: suppose that the ratio 
of government spending to output, denoted 
by g,, follows a random walk (in continuous 
time, for analytical convenience), so that 


(8) dg, = + dW, 


where {W,} is a standard Wiener process.* 
We assume the drift # to be strictly posi- 
tive, reflecting the tendency of government 
outlays relative to GDP to rise over time in 
the absence of feasibility concerns. If equa- 


>Modeling g, aS a (regulated) Brownian motion, we 
can use the technical apparatus developed in the litera- 
ture on exchange-rate bands (see the papers in Paul 
Krugman and Marcus Miller [1992] and Bertola [1993] 
for a survey). The application of this approach to issues 
of fiscal sustainability was considered by Miller et al. 
(1990) and Bertola (1990). 
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tion (8) always described the dynamics of 
{g,}, we would have E{g.}=g,+ 0(z 1) 
for all t, all z>¢, and integration of (8) 
would yield 


(9) 


Using (9) in (6a), we see that when g dis- 
plays no tendency to reverse its upward 
movements, changes in current government 
spending imply changes in expected future 
taxation in the same direction and, hence, 
movements in current consumption in the 
opposite direction. 

With g following a random walk, how- 
ever, the present discounted value of future 
government spending and taxes would ex- 
ceed the present discounted value of output 
with probability approaching 1. Government 
solvency, however, requires that continuing 
increases in current government spending 
must eventually be slowed or reversed. Such 
stabilizations of an upward trend in the 
ratio of government spending to output of- 
ten have a discrete character: there are 
infrequent sharp changes in government ex- 
penditure patterns, with a sharp cut in gov- 
ernment spending or its rate of increase 
occurring only after a significant upward 
drift, as exemplified by the Danish and Irish 
experience reviewed above. The _infre- 
quency of fiscal regime changes may reflect 
political constraints blocking agreement on 
fiscal retrenchment, which can be reached 
only when the ratio of government expendi- 
ture to output reaches levels that are suffi- 
ciently high to be deemed critical. (The 
argument that the enactment of an in- 
evitable stabilization may require the pre- 
stabilization situation to deteriorate 
markedly is explored in Alberto Alesina and 
Drazen [1991] and Drazen and Vittorio U. 
Grilli [1993].) 

We proceed to model discrete interven- 
tions by assuming that the ratio of gov- 
ernment spending to output follows the 
dynamics in (8) but jumps downward 
discontinuously when it reaches excessively 
high levels, terming such discrete shifts 
stabilizations. For simplicity, we let all stabi- 
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lizations result in g, dropping to a given 
common-knowledge level g., and we as- 
sume that stabilizations may occur either 
when g, = g. or when g, = g.’ Here, 2 rep- 
resents a ratio of government spending to 
output sufficiently high to trigger a stabiliza- 
tion with certainty, and g, is a lower value 
which the government may or may not con- 
sider crucial. We denote the probability of a 
stabilization when g,=g, with p, and we 
let p= p, initially. If the stabilization does 
not take place when g, reaches g, for the 
first time, however, the public recognizes 
that g. is not considered “too high” by the 
government. Thereafter, g. is no longer 
deemed critical, and p=0: if the noise 
component in (8) brings the spending-to- 
output ratio back to g,, a stabilization is 
assigned zero probability. The inevitability 
of an eventual stabilization follows from the 
fact that the Brownian-motion dynamics 
would bring the government’s tax obliga- 
tions to infinity with probability 1. Thus, if 
g, goes past g., 1. eventually drops back to 
g. with probability 1 when it reaches g. At 
that point, p is reset to its initial value p,, 
implying that g. is once again a candidate 
for a stabilization. We further assume that 
there is no analogous feasibility problem for 
low values of g,, which are not likely to be 
observed anyway given the positive drift. 
The modeling of stabilizations is not 
meant to be fully realistic, of course, but we 
believe that its assumptions do capture 
politico-economic phenomena of some in- 
terest. The public’s uncertainty about 
whether a specific value g. is high enough 
to induce an agreement reflects the same 


constraints that rationalize the existence of 
the difficulty of 


trigger points, namely, 
reaching political agreement on fiscal re- 
trenchment. When no stabilization takes 
place at a potential trigger point, it is real- 
ized that the threat of fiscal unsustainability 
is not strong enough to induce agreement 
under current circumstances; if stabilization 


*More generally, there may be a number of possible 
trigger points or return points. This simple case illus- 
trates the main results. An alternative approach would 
be to focus on the inference problem of discovering the 
true process generating government spending for an 
observed time series, as suggested in Drazen (1990). 
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does occur at g., on the other hand, there 
is no guarantee that the same point will 
necessarily induce agreement the next time 
it is hit. The assumption that lower values 
of g are once again possible critical points 
after a drastic stabilization at g is appealing 
on theoretical grounds, as it ensures that 
the economy repeatedly returns to the two 
possible situations and that (if the economy 
under consideration were observed for a 
long enough time) initial conditions would 
eventually become irrelevant. It may be ra- 
tionalized considering that, if government 
spending has been allowed to drift off, this 
experience may provide more political will 
not to let it happen again. 


III. Solution 


The model outlined above admits a 
closed-form solution. Stabilizations occur 
(with given probability) only when the ratio 
of government expenditures to output 
reaches “trigger” values, and the dynamics 
between stabilizations are given by the pro- 
cess (8) which is Markov in levels. The like- 
lihood of stabilizations at every time in the 
future is then determined, as of time f, by 
the current level of g, only, and {g,, p,} 
form a bivariate Markov process. All condi- 
tional expectations in the integral (7) defin- 
ing v, are then functions of g, and p,, to 
imply that v, = v(g,; p,). 

By Ito’s lemma, whenever {g,} follows 
Brownian-motion dynamics (and, by our as- 
sumptions, p, is not changing), we have 

l 

E{ dv} = dvu'(g,;p,)+ 85 
where the primes denote the partial deriva- 
tives of with respect to g. From the 
differential form of (7), 


(7a) rv, = 8, + —E,dv,} 


t 


dt 


we obtain a_ differential 


U(g,; p,): 


(10) 


U"( 8,521) 


equation for 
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which must be satisfied by the v(-) function 
for all values of g for which the probability 
of a stabilization is zero. The expression in 
(9) is a particular integral of (10), corre- 
sponding to the (unsustainable) case in 
which stabilizations never occur. All solu- 
tions of (10) are then linear combinations of 
(9) and of solutions of its homogeneous 
part, in the form 

(11) h(g) = + K,e°8 

where a, and a, solve the characteristic 
equation 


1 
+d8a—r=0. 


(12) 


The roots of this equation are real, distinct, 
and of opposite sign if ra* > 0. 

The two terms in (11) reflect the upward 
or downward bias in the discounted expec- 
tation of future spending due to the possi- 
bility of upward or downward jumps. By the 
assumptions above, no discrete upward 
jumps in government spending are triggered 
by “too low” values of g. Hence, the likeli- 
hood of future stabilizations becomes 
smaller and smaller as g approaches —»%, 
and it must be the case that 


lim h(g)=0. 


This implies that if a, is the negative root 
of (12), then the solution sets K, in (11) to 
zero, while the current probability of stabi- 
lization at point g, will be reflected in the 
value of the constant of integration associ- 
ated to the positive a root, which we will 
denote K(p). Combining (9) and (11), solu- 
tions of (10) must then take the form 


(13) vU(8,3P) K( p)e**: 
where 


= 5 > 


o- 


Qa 
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The model under consideration allows for 
two possible values of p,: p., when g, is a 
possible stabilization point, and 0, when it is 
not. At times when a stabilization may oc- 
cur, v(g) must satisfy the boundary condi- 
tions implied by the requirement that the 
expectation of future government spending 
not be expected to change (i.e., the law of 
iterated expectations). Consider first the 
case in which g, has passed through g, 
with no stabilization, so that we are consid- 
ering the function v(g,;0). Since g drops to 
g, with probability 1 whenever it reaches g, 
the present discounted value of expected 
future taxes must be identical at g and g,; 
that is, 
(14a) v(8;0) =v( )- 

Similarly, when stabilization at g, is per- 
ceived as possible, there can be no expected 
jump in v(g; p). Thus, it must be true that 


(14b) 
Pc) = DU( 855 Pe) 


+(1- p,)v(g,30) 


when the expenditure-to-output ratio 
reaches the critical value g, and the public 
is uncertain ex ante as to whether a stabi- 
lization may occur at that point. 

Using (13) and (14), we find that K(p,) 
and K(0Q) must solve the system 


][K(p.) 


2.)/* 


This yields values for K(p,) and K(0) and, 
hence, solutions for v(g,; p,) and v(g,;0) in 
(13). Quite intuitively, both K values are 
negative, reflecting the expectation of fu- 
ture downward jumps in the level of g. 

To summarize, the changes in future gov- 
ernment spending necessary to ensure sol- 
vency in response to increases in current 
government spending imply that a current 


(15) 
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Ficure 5. Mope : r = 0.05, o = 0.006, 3 


increase in the government spending-to- 


output ratio, in general, has nonlinear ef- 
fects on the expected present discounted 
value of the ratio of future government 
spending to output. It may therefore be 
misleading to use simple VAR’s to predict 
future government policies, as was done for 
example by Giavazzi and Pagano (1990). 
The mean reversion in government spend- 
ing which characterizes stabilization has the 
further implication that the present dis- 
counted value of expected future g, rises 
less and less as g, rises. One should note 
that this nonlinearity does not depend on 
intervention being discrete. A number of 
other models would have similar implica- 
tions: for example, stabilizations may be 
modeled as discrete changes in the drift 3 
of g,, or interventions may be infinitesimal 
rather than discrete. Nor need interventions 
be infrequent, as long as the present dis- 
counted value of future government spend- 
ing increases less fast than g, itself. What is 
crucial is mean reversion, with increases in 
g making it more likely that it will come 
back to some return point. 


g 


0.0045, p. = 0.5, f =0 


IV. Observable Implications: Consumption 
and the Current Account 


We can now study the relation between 
observable changes in government spending 
on the one hand and observable changes in 
consumption and the current account on 
the other. Using the results of the previous 
section we may write (6a) as 
(16) c,—1f, =1—rv(g,3p).- 
Theoretically, f, is a function of the real- 
ized history of {g,}; over short intervals, 
however, f will be roughly constant since 
changes in f represent the cumulated effect 
of flow consumption decisions from (16). 
Over longer intervals, changes in f will 
complicate the relation of c and g in Figure 
5 but will not change the underlying charac- 
teristic, namely, the flattening out for high 
values of g, which is the object of our 
interest. As we do not have information on 
f, we choose to concentrate on the relation- 


ship of c to g, and to work with equation 
(16) for given f. 
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We graph the relation given by (16) in 
Figure 5, choosing parameter values that 
are realistic for the Danish and Irish experi- 
ence. The upper, solid curve shows the be- 
havior of consumption for g,<g, when 
there is probability p,>0 that a discrete 
policy shift will occur when g, hits g,.. The 
lower, dashed curve shows the behavior of 
consumption when there is no probability of 
a discrete change in government spending 
at g.. The diagonal dashed-dotted line is a 
45° line corresponding to a zero trade bal- 
ance. 

Using the solution from (13) for v(g,; p) 
in (16) and differentiating, the slope of the 
relation between c and g is 


dc 
08, 08, 


=—i—rak( pye** 


at points where v(g,;p) is differentiable, 
that is, at all points other than g, along 
v(g,; pp.) and g along v(g,;0). [Note that for 
a change in g occurring in infinitesimal 
time f is constant to first-order; hence, (17) 
is strictly correct.] 

Several points should be noted. First, if 
discrete stabilizations were ruled out, move- 
ments in g, would be matched by equal-size 
movements in c, in the opposite direction 
[see equation (9)]: we would then have 
dc, /0g,=—1, and c and g would move 
along a 45° line in Figure 5. When discrete 
cuts in spending are included, the relation- 
ship of c — rf to g is flatter than a 45° line, 
and may even slope upward. At low levels 
of g, c falls less than dollar-for-dollar with 
increases in current government spending: 
this of course reflects the fact that increases 
in current government spending imply in- 
creases in the probability of a future dis- 
crete spending cut and hence a lower 
increase in future expected taxes than in 
the no-expected-intervention case. Thus, 
higher government expenditure only par- 
tially crowds out consumption, and the 
model—in spite of its extreme neoclassical 
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structure—has distinctly Keynesian observ- 
able implications. With fixed output, the 
difference between the change in consump- 
tion and government spending is reflected 
in the current account. 

Consider now the upper curve. When g, 
is reached at point A, there is a probability 
p, of a discrete cut to level g, of the ratio of 
government spending to output. If the cut is 
realized, we jump to point D: consumption 
jumps up, reflecting the discrete fall in the 
present discounted value of future taxes. 
Note however that consumption rises by less 
than government spending falls. Alterna- 
tively, with probability 1— p,, there is no 
stabilization when we hit A. This indicates 
that g. was not a critical level of spending 
for the current government, so that ex- 
pected future spending and thus expected 
future taxes are discretely higher. Hence 
consumption falls discretely, as indicated by 
the jump to point B on the lower curve. 
That is, a small increase in current govern- 
ment spending may induce a large, discrete 
fall in consumption. Put another way, the 
expectation that a stabilization is possible at 
some perceived critical value will be con- 
tractionary if the stabilization does not 
materialize. Changes in the perceived prob- 
ability of a discrete cut in government 
spending will have a similar effect, generat- 
ing discrete jumps in the consumption-to- 
output ratio with little change in the ratio of 
government spending to output. 

Along the lower curve, stabilization is ex- 
pected to occur with certainty when and 
only when the ratio of government con- 
sumption to output reaches g. This curve 
also bends away from the 45° line, the bend 
being progressively greater for higher values 
of g,. For g, sufficiently high, increases in 
g, induce increases in c, (again, for given 
f). This is because at high levels of g, 
further increases in government spending so 
increase the probability of a discrete cut in 
future government spending as to lower the 
present discounted value of future expected 
taxes. This result is reminiscent of the 
inflation-overshooting result found in 
Drazen and Helpman (1990) and occurs for 
basically the same reason. 
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The response of the current account to 
changes in government spending is, of 
course, fully determined by the above re- 
sults since output is fixed. To derive this 
more formally, we first write the ratio of the 
current account to output, denoted x,, as 
Differentiating with respect to g, and using 
(17), we obtain, for values such that v(g; p) 
is differentiable, 


OX, 
—=raK(p)e**. 


og, 


(19) 


Since K(p) is negative this derivative is 
negative and increasing in g,, so that along 
a curve the current account always worsens 
with increases in g,. This is identical to the 
above observation that the c,—~rf, line 
bends away from the 45° line, with the bend 
being greater the larger is g,. 

We quickly note some further features of 
movements in the current account. At point 
A, an improvement in the current account 
occurs with probability 1. If a stabilization 
occurs (implying a move to D), c, rises by 
less than g, falls, so the current account 
improves. If no stabilization takes place, we 
move to B, c, falls with no change in g,, 
and the current account improves. Similarly 
along the lower curve a stabilization at g 
induces a current-account improvement. 
Note further that, along the upward-sloping 
portion of the c(g,;0) curve, increases in g, 
induce large deteriorations in the current 
account, as private consumption rises in an- 
ticipation of a stabilization. 


V. Discussion and Extensions 


In this section we compare the impli- 
cations of the simple model with the ex- 
perience of Denmark, Ireland, and other 
countries; to the extent that they do not 
match, we suggest extensions of the model. 

Recalling the evidence reviewed in Sec- 
tion 1, our model appears capable of ex- 
plaining the qualitative features of Danish 
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and Irish consumption data; for realistic 
parameters, the quantitative match between 
theory and data is satisfactory as well. Fig- 
ures 3 and 4 (where dashes plot a 45° line) 
display a nonlinear relation between the 
ratio of private consumption to output and 
the ratio of government consumption to 
output which is quite similar to the theoreti- 
cal one in Figure 5. The main puzzling 
feature of the Danish and Irish data, namely, 
the sharp drop in government spending with 
no similar change in private consumption at 
the time of stabilization (or the “expan- 
sionary effects of stabilization’’), is readily 
explained by the model on the assumption 
that the stabilization was anticipated. 

Our theoretical model was kept simple in 
order to make clear the general insight of 
the expectations approach to fiscal policy: 
solvency of the government and forward- 
looking consumer behavior imply that, in a 
neoclassical model, current consumption will 
fall less than one-for-one with increases in 
current government spending because 
changes in current government spending en- 
gender expectations of future fiscal changes 
in the opposite direction. As mentioned 
above, other models of the process generat- 
ing government spending would have simi- 
lar implications, as long as they maintained 
the characteristic of mean reversion that 
fiscal sustainability implies. 

The simple model we present has another 
feature that is consistent with the data, 
namely, the effect of changes in the per- 
ceived probability of a discrete cut in gov- 
ernment spending. The 1982 drop in 
consumption in Ireland is often attributed 
to the Keynesian effects of the sharp in- 
crease in taxes with the first stabilization 
attempt (Giavazzi and Pagano, 1990). Our 
model suggests how expectational effects 
could explain this drop. It has been argued 
(Rudiger Dornbusch, 1989) that the need 
for fiscal austerity was recognized already in 
1981. In our framework, then, the crucial 
event in 1982 was not the tax increase (which 
in a neoclassical model should have little 
effect on current consumption) but the ab- 
sence of a significant cut in g when this had 
come to be expected. According to the 
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FiGurE 6. BELGIUM: PRIVATE AND GOVERNMENT CONSUMPTION RATIOS 


Note: Two-digit numbers in the plot denote the years corresponding to each point. 


model, the failure of the cut to materialize 
at what had been perceived as a critical 
point should and did result in a drop in c 
with no change in g, and in a “failed” 
stabilization. 

Similar patterns of an inverse relation 
between c and g for low values of g which 
flattens out as g rises are observed else- 
where, for example, in Belgium (Fig. 6), 
Sweden (Fig. 7), and the United Kingdom 
(Fig. 8), all of which enacted programs of 
fiscal austerity in the 1980’s. (The dashed 
line is again a 45° line.) 

Plotting the ratio of the current account 
to GDP against g using Irish and Danish 
data yielded a significantly worse match of 
the model to the data than the plots of c 
against g (plots not shown). We will briefly 
discuss the possible reasons for the poor fit 


For Denmark, our model would similarly associate 
the sharp vertical drop in the consumption-to-GDP 
ratio in 1972 with a fall in the perceived probability of 
a government spending cut; we do not know whether 
this was indeed the case. 


between the simple model above and the 
data. 

First, the relation derived in (19) arises 
because of the accounting identity (18). 
Changes in the sum of c and g must be 
mirrored by changes in x in the opposite 
direction when output is constant. If invest- 
ment were added as the final component of 
output, the relation in (19) would apply to 
the derivative of the sum of investment and 
the current account (relative to GDP). The 
relation of this sum to g observed in the 
data for our sample of countries (not shown 
here) is indeed closer to that predicted by 
the model but contains no new information. 
Of course, this raises the issue of extending 
the model to include investment behavior. 
Changes in the anticipated fiscal environ- 
ment should strongly influence current in- 
vestment decisions, as for example the 
current investment boom in Mexico sug- 
gests. The precise effect of future fiscal 
policy on the forward-looking investment 
decisions of firms, however, would depend 
crucially on the nature of the adjustment 
costs they face. We plan in future research 
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FIGURE 7. SWEDEN: PRIVATE AND GOVERNMENT Co MPTION RATIOS 


Two-digit numbers in the plot denote the years corresponding to each point. 
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to model forward-looking investment behav- 
ior along with consumption decisions. The 
point of this paper, however, was to empha- 
size the general message of how expecta- 
tions of future fiscal shifts affect consump- 
tion in a nonlinear way, not to set out a 
complete macro model. The effects of ex- 
pected fiscal policy on consumption that we 
focus on would not be qualitatively changed 
by explicit consideration of investment. 
Second, a distinction between traded and 
nontraded goods could also help explain the 
poor fit of theory and current-account data. 
If we added nontraded goods to the model 
and assumed that the government pur- 
chased both types of goods, the results for 
consumption would be similar, but those for 
the current account could change signifi- 
cantly. As with tradables, an increase in 
government expenditure on nontraded 
goods would increase the probability of a 
future cut in government consumption of 
nontraded goods. With fixed output in each 
sector, market clearing implies that current 
private consumption of nontraded goods 


would fall by exactly the amount that gov- 
ernment consumption rose. Hence the sign 
of the consumption results would be unaf- 


fected, except perhaps in the upward- 
sloping region of the curve. For the current 
account, however, a future anticipated cut 
in government consumption of nontraded 
goods could induce a surplus rather than a 
deficit for any value of g, if traded and 
nontraded goods were complements (see 
Drazen and Helpman, 1987). That is, with 
fixed output, the spending cut implies an 
increase in consumption of nontraded goods 
and thus, via complementarity, increased 
consumption of tradables on the date the cut 
takes place. This effect taken alone implies 
that a current-account surplus will be run in 
anticipation of a stabilization in order to 
finance higher future expected consumption 
of tradables. The net effect might therefore 
be ambiguous. 

A further problem related to the poor 
match between theory and data is that, while 
the nondistortionary tax model implies that 
the relation between c and g should be 
everywhere flatter than the 45° line, the 
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data often display a steeper relation (for 
example in Denmark in the 1960’s and early 
1970’s). One way to explain this is that net 
foreign assets are falling over time, so that 
permanent income is below the present dis- 
counted value of the endowment flow minus 
the present discounted value of future taxes. 
A second mechanism that might explain this 
feature of the data is distortionary taxation. 
We therefore explore briefly how the theo- 
retical results would be affected if taxes 
were distortionary and current output de- 
pended negatively on the current level of 
distortionary taxation.° 

Consider first an increase in current gov- 
ernment spending with no increase in cur- 
rent taxes. Current output is unaffected, 
while current consumption will be “doubly” 
affected, relative to the earlier model. The 
increase in current government spending 
raises the expected value of future govern- 
ment spending and hence taxes. This re- 
duces consumption for a given level of 
permanent income, as before. In addition it 
reduces permanent income due to the dis- 
tortionary nature of (future) taxation. 
Hence, the fa!l in the ratio of consumption 
to output in response to an observed in- 
crease in the ratio of government spending 
to output will be greater than in the earlier 
model (or the rise will be smaller if we are 
on the unward-sloping portion of the curve). 
If this distortionary effect were strong 
enough, the relation would be steeper than 
the 45° line. A discrete cut in government 
spending would similarly cause a greater 
consumption boom in the case where it was 
not expected with certainty. 

At the other extreme, consider an in- 
crease in current taxes with no change in 
current government spending and hence ex- 


°Alternatively, current output could depend on the 
expected level of future distortionary taxation, which 
would arise from factor-accumulation decisions or in- 
tertemporal substitution in factor supply, suggesting 
the importance of carefully modeling investment. We 
believe this to be an important possibility but leave a 
formal analysis for a later paper. Paolo Manasse (1991) 
extends the model proposed here to allow for tax- 
smoothing behavior on the part of the fiscal authori- 
ties. 
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pected future spending, and suppose that 
the current level of taxes gives no informa- 
tion about the future level of taxes not 
already contained in government spending. 
In the previous model with Ricardian equiv- 
alence, this had no effect. In this model, 
total consumption C, is still unaffected, 
since permanent income is_ unaffected. 
There is, however, a transitory fall in in- 
come and hence a rise in the ratio c,. 
Therefore, a fall in the consumption-to- 
GDP ratio, such as we saw in Ireland in 
1982 after the tax-based stabilization at- 
tempt, requires a fall in the perceived prob- 
ability of government spending cuts in both 
the distortionary tax case and the nondistor- 
tionary tax case. An increase in distor- 
tionary taxes would imply an unambiguous 
worsening of the current-account-to-GDP 
ratio. 

To summarize, we would argue that the 
general implication of letting taxes be dis- 
tortionary in a neoclassical model is that 
reductions in the deficit via tax increases 
can have real effects, effects different than 
those arising from deficit reductions due to 
government spending cuts. In the latter case, 
private consumption will increase, while 
output will at worst remain unchanged and 
may rise if current output depends nega- 
tively on future expected taxes. In the for- 
mer case of tax increases with no change in 
expected government spending, output will 
fall, and consumption will at best remain 
unchanged and may fall if a government 
spending cut had been expected. The expec- 
tations view of fiscal policy therefore ap- 
pears to be capable of explaining the very 
different effects of government spending 
cuts and tax increases which Giavazzi and 
Pagano (1990) found to characterize numer- 
ous countries in the 1980's. 


VI. Conclusions 


We proposed a model in which the effects 
of fiscal policy on current economic activity 
depend on what sort of expectations of fu- 
ture policy are engendered, focusing in par- 
ticular on the consumption effects of ex- 
pected discrete changes in future fiscal 
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policy. A simple model in which the ratio of 
government spending to GDP follows regu- 
lated Brownian motion predicts a nonlinear 
relation between consumption and govern- 
ment spending, with jumps in the ratio of 
consumption to GDP at certain points. We 
observe this type of nonlinear relation be- 
tween the ratio of consumption to GDP and 
the ratio of government spending to GDP in 
Denmark, Ireland, and several other coun- 
tries. Other specifications for the process of 
government spending that capture the qual- 
itative nature of stabilizations would have 
similar effects. 

We believe that the results we obtain are 
more general than the specific application 
we present and that the simple model in 
this paper should be read as an illustration 
of a general phenomenon. The results indi- 
cate several directions for further research. 

The expectations view of fiscal policy in- 
dicates, among other things, that a neoclas- 
sical model may generate observable impli- 
cations that look quite Keynesian. In a 
nonmonetary closed economy with fixed 
output, an increase in government spending 
that induces the expectation of a future cut 
will cause interest rates to rise. If effort 
were endogenous, increases in g would im- 
ply expansions in output as consumers 
attempt to increase current income. It ap- 
pears feasible to specify and solve a two- 
country model in which increases in g at 
home induce increases in world interest 
rates and world output and a deterioration 
of the home country’s trade balance. On the 
empirical side, verification of the role of net 
foreign assets and distortionary taxation is 
important. 

It would of course be beneficial to de- 
velop a way of formally estimating and test- 
ing a model stressing fiscal expectations. 
Unfortunately, however, this is very diffi- 
cult, or even impossible, given the infre- 
quent nature of stabilizations or drastic 
changes in fiscal regimes and the short data 
span. However, the model does deliver pre- 
dictions that are in broad agreement with 
the experiences of a number of countries 
and, on a more general methodological level, 
points out possible problems with applying 
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simple time-series methods when economic 
considerations suggest significant nonlinear- 
ities in behavior. 
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Economic Policy, Economic Performance, and Elections 


By JoserH E. HARRINGTON, JR.* 


This paper explores the role of reelection pressures in determining economic 
policy. By assuming that voters are uncertain as to the efficacy of different 
policies, a reelection process is derived which depends on both a politician’s past 
policies and the level of economic activity during his time in office. It is shown 
that the less uncertain are voters as to which policy is best, the more likely is a 
politician to manipulate policy for reelection purposes. Manipulation entails 
implementing the policy that is more likely to be well received rather than the 
one that maximizes income. (JEL D78, E60) 


The president is a major player in the 
U.S. economy. His decisions with respect to 
fiscal and monetary policy, the regulation of 
markets, the enforcement of antitrust law, 
and the erection of trade barriers, for exam- 
ple, can have sizable economic effects. In 
light of the significance of presidential eco- 
nomic policy, it is important to understand 
the forces that underlie these policies. One 
factor that has drawn the attention of both 
economists and political scientists is the 
pressures of reelection. In principle, a 
first-term president can influence his reelec- 
tion prospects directly through the policies 
he selects and indirectly through the impact 
of these policies on economic performance. 
A president who strategically manipulates 
policy in order to improve his reelection 
prospects not only creates a policy bias in 
the current period but also affects future 
policy if he is successful in influencing the 
outcome of the election. 

In order to explore the incentives for the 
manipulation of policy, one must first de- 
velop a model of the reelection process. 
The two leading lines of research follow 
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sity, Baltimore, MD 21218. I thank Dan Klein, Stergios 
Skaperdas, two anonymous referees, and seminar par- 
ticipants at the University of Virginia for their com- 
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course, my own. This paper was completed while the 
author was on leave at Northwestern University. The 
hospitality of the Department of Management and 
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from the pioneering work of Anthony Downs 
(1957) and V. O. Key, Jr. (1966).' Downs 
argued that an incumbent’s success in bid- 
ding for reelection is largely determined by 
his past policies and actions. That is, voters 
look at the past decisions of the incumbent 
and make inferences about the decisions he 
will make in the future. In contrast, Key 
argued that voters largely ignore the past 
decisions of politicians and instead focus on 
measures of performance; in particular, 
economic performance. 

Recent related research in economics and 
political science can be placed in either of 
these two camps. Past policies are modeled 
as the dominant factor in elections in the 
work of Kenneth Rogoff and Anne Sibert 
(1988), Rogoff (1990), W. Robert Reed 
(1989a,b), and Gregory D. Hess (1991). For 
example, Rogoff and Sibert (1988) and 
Rogoff (1990) assume that the incumbent’s 
level of competency is private information, 
where greater competency means an ability 
to generate higher revenue or a higher level 
of government services. The incumbent 
chooses monetary policy (Rogoff and Sibert, 


Early research includes William D. Nordhaus (1975) 
and C. Duncan MacRae (1977) on the political busi- 
ness cycle and Morris P. Fiorina (1981) and Douglas A. 
Hibbs, Jr. (1987), which build on Key’s idea of retro- 
spective voting. Also, see M. Daniel Bernhardt and 
Danie! E. Ingberman (1985), Alex Cukierman and 
Allan H. Meltzer (1986), and John R. Lott, Jr. and 
W. Robert Reed (1989). In contrast to work to be 
reviewed later, these analyses do not formally model 
voters so that voting behavior must be specified ad hoc. 
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1988) or fiscal policy (Rogoff, 1990) and 
thereby influences some measure of eco- 
nomic performance. The outcome of the 
election is shown to depend only on the 
incumbent’s past policies and is indepen- 
dent of economic performance. 

A second line of research follows the lead 
of Key (1966). John A. Ferejohn (1986), 
David Austen-Smith and Jeffrey S. Banks 
(1989), and Alberto Alesina and Alex 
Cukierman (1990) specify that the incum- 
bent’s actions or policies are not observed 
by the electorate. As a case in point, Alesina 
and Cukierman (1990) assume that the in- 
cumbent’s policy preferences are private in- 
formation. Since observed economic perfor- 
mance is a noisy signal of the incumbent’s 
underlying policy, voters use performance 
to draw inferences about the incumbent’s 
current policy and thus about his future 
policy intentions. However, even in this class 
of theories, if the incumbent’s policies were 
observable then voting would depend only 
on policy and thus be independent of per- 
formance.” 

Though there is minimal empirical work 
that takes account of the electoral impact of 
a president’s policies, there is considerable 
empirical support for the hypothesis that 
the performance of the economy affects the 
outcome of the election. Summarizing the 
evidence, D. Roderick Kiewiet and Douglas 
Rivers (1985) find that a 1-percent decline 
in real income is associated with a reduction 
of the incumbent party’s vote share of be- 
tween 0.5 percent and 1 percent. In inter- 
preting this empirical regularity, existing 
theories would argue that the performance 
of the economy is simply acting as a proxy 
for the president’s economic policies. The 
implication is that if the relevant policy in- 
struments were observable and controlled 
for in the empirical specification, the signif- 
icant role of economic performance would 
disappear. 


*Banks and Rangarajan K. Sundaram (1990) con- 
sider a model of adverse selection in which incumbents 
differ according to their competency but do not make 
decisions. Obviously, performance plays an important 
role in that setting. 
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In this paper, I develop a theory that 
provides an alternative explanation for the 
empirical relationship between economic 
performance and electoral outcomes. It is 
argued that measures of economic perfor- 
mance are of intrinsic interest to voters and 
are not simply acting as proxies for the 
president’s economic policies. An electoral 
process is endogenously derived which de- 
pends on both the (observable) economic 
policies of the incumbent and the perfor- 
mance of the economy during his time in 
office. The key modeling innovation is to 
allow voters to be uncertain as to which 
economic policy is more productive in gen- 
erating income. Furthermore, voters and 
politicians can have different opinions as to 
which policy is best. Thus, voters are uncer- 
tain about which policy is best and about 
the incumbent’s future policy intentions, as 
they are uncertain of which policy the in- 
cumbent believes is best. We might add that 
preceding research has ignored this source 
of voter uncertainty, as voters’ policy prefer- 
ences are treated as primitive. In fact, pol- 
icy preferences are derived from one’s pref- 
erences on income and one’s beliefs over 
the effectiveness of different policies. An 
important implication is that voters’ policy 
preferences can change over time in re- 
sponse to new information about the effi- 
cacy of different policies. The endogeneity 
of voters’ policy preferences will play a cen- 
tral role in the analysis.° 

One of the virtues of such a theory is that 
it can provide insight as to when elections 
will be largely influenced by policy and when 
they will be largely influenced by perfor- 
mance. We find that voting behavior and 
thereby electoral outcomes are more sensi- 
tive to the policy chosen, the more confi- 
dent are voters in their beliefs as to which 
policy is best. Voting behavior approaches 
being performance-based, as in Alesina and 


>Though, to the best of my knowledge, this is the 
first electoral model to allow for uncertainty over the 
mapping from policies to outcomes, this type of uncer- 
tainty has been allowed for in legislative models (see 
Austen-Smith and William Riker, 1987; Thomas W. 
Gilligan and Keith Krehbiel, 1989). 
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Cukierman (1990), when voters become 
more uncertain as to which policy is best, 
and it approaches being policy-based, as in 
Rogoff (1990), when voters become less un- 
certain. 

Whether or not electoral outcomes are 
dominated by policy or performance turns 
out to be an important determinant of the 
policy decisions of elected officials. My 
analysis shows that the more sensitive are 
electoral outcomes to policy, the greater is 
the extent of policy manipulation by the 
incumbent for the purposes of reelection. 
Specifically, the incumbent is more likely to 
implement the policy that will be well re- 
ceived and less likely to implement the pol- 
icy that will maximize income. This behavior 
not only creates a bias in current policy but 
also results in less than desirable incum- 
bents being reelected, thereby influencing 
future policy. 


I. The Model 


Consider an elected office in which it is 
the responsibility of the officeholder to se- 
lect and implement a policy that affects the 
income of his constituents. I assume that 
the elected official chooses from among just 
two policies, denoted policy | and policy 2. 
While this assumption is made for reasons 
of tractability, it has some appeal in that 
voters may find it difficult to make subtle 
distinctions between policies. For example, 
they may only take note as to whether or 
not taxes are raised. In this sense, policies 
may be quite broadly defined. In fact, they 
may be so broadly defined as to be more 
appropriately interpreted as ideologies; for 
example, extensive or minimal government 
intervention in the economy. 

Implementation of policy x, where x € 
{1,2}, generates per capita income y= 
w, +e where w, is the policy-specific com- 
ponent and ¢ is a random shock. All voters 
receive income y, so differential policy ef- 
fects are not issues. It is assumed that e is 
independently and identically distributed 
across time and policies and that ¢~ 
MN(0,0,2) where a, > 0 and finite. After vot- 
ers observe the policy and the realized level 
of income, the incumbent runs for reelec- 
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tion against a challenger. The winner imple- 
ments a policy, and the game ends after 
income is realized. With the exception of 
assuming that voters observe policy, this 
structure is identical to that specified in 
Alesina and Cukierman (1990). 

An incumbent is presumed to care about 
holding office and about the level of in- 
come, the latter because he is altruistic or 
his income is also y. The incumbent is risk- 
neutral with respect to income, and his util- 
ity is additively separable in income and 
holding office. Let k denote the value at- 
tached to holding office, referred to as 
“ego-rents” by Rogoff and Sibert (1988). 
The utility to the incumbent from being in 
office when income y is realized is then 
represented by @y+k where 6>0 and 
k >. His total payoff for the two-period 
game is simply the sum of the discounted 
single-period utilities where 5 €[0,1] is the 
discount factor. Voters are assumed to care 
only about income and are risk-neutral as 
well. The horizon is such that each voter 
will want to maximize his expected income 
so that voters’ time preferences do not play 
a role. 

A singular feature of the model is that I 
do not take voters’ policy preferences as 
being primitive. Instead I specify voters’ 
preferences over outcomes and their beliefs 
over the efficacy of policies 1 and 2. Voters 
are allowed to be uncertain as to which 
policy is more effective; this uncertainty is 
represented by having the policy-specific pa- 
rameters w, and w, be unknown. Though 
the income generated by a policy is the 
same across voters, voter heterogeneity in 
their policy preferences arises by allowing 
voters to have different beliefs as to the 
efficacy of policies 1 and 2. Voters’ beliefs 
on w, and w, are assumed to be normally 
distributed where the prior expectations on 
w, and w, are allowed to vary within the 
electorate. Thus, a voter of type (u,, >) is 
specified to have beliefs: w,~ /(u,,07) 
and where and fi- 
nite. Each voter’s beliefs are private infor- 
mation. 

The significance of endogenizing policy 
preferences is that policy preferences are 
not constrained to be fixed over time. In 
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particular, if voters receive new information 
about the efficacy of either or both policies, 
their beliefs on (w,,@,) can change and, 
along with them, their policy preferences. 
The endogenous nature of voters’ policy 
preferences will play a central role in the 
analysis. 

In specifying the type-space of voters, one 
needs to consider the ways in which voters’ 
beliefs might differ. Two sources of hetero- 
geneity in voters’ beliefs become immed- 
iately apparent. First, voters could have 
different beliefs as to which policy will gen- 
erate higher income on average. Second, 
voters could differ in terms of their opti- 
mism; that is, one voter could expect in- 
come to be higher under both policies than 
does another voter. In order to prevent too 
many sources of voter heterogeneity from 
confounding the analysis, I have decided 
only to allow voters to differ in their beliefs 
over which policy is best. I chose it because 
I believe it is the more important of the 
two. 

Following through on the preceding dis- 
cussion, the type space must be specified so 
that no voter type is larger than another 
voter type in both components. This means 
that the type space is represented by a 
decreasing function in R? so that it is of the 
form {(u,¢(u))|u A} where o(-): A> 
B,¢(-) is a strictly decreasing function, and 
A,BCR. I will specifically assume that A = 
B =[0,1] and (-) is linear. Thus, the type 
space is of the form {(u,1— )|u <[0, 1]} so 
that a single parameter represents a voter’s 
type. I conjecture that all results are robust 
to allowing ¢(-) to be any differentiable 
decreasing function. 

To summarize, since y=@,+e and 
e~ V(0,02), all voters believe that in- 
come is normally distributed with variance 
a. Where voters disagree is over expected 
income (denoted w, for policy x). A type-w 
voter’s beliefs over expected income from 
policy 1 are represented by /(u,o7) and 
from policy 2 they are represented by 
MN(1—,07). Note that a type-w voter ini- 
tially believes policy 1 to be more effective 
than policy 2 if and only if > 5. 

Like voters, politicians have beliefs over 
(w,,@>5) which are private information. In 
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contrast to voters, however, the politician is 
assumed to be rather dogmatic in that a 
type-a incumbent believes with certainty 
that (w,,@,)=(a,1—a), a €[0,1]. Hence, 
a type-a incumbent believes that income is 
normally distributed with mean a when pol- 
icy 1 is implemented and mean 1— a@ when 
policy 2 is implemented, with a variance of 
o,. This specification of politicians’ beliefs 
considerably simplifies the analysis and al- 
lows me to focus on the implications of 
voters’ policy preferences being endoge- 
nous. What is critical to the analysis is that 
the policy that a politician believes to be 
most effective does not change over the 
course of his time in office. At a minimum, 
it seems plausible that voters are relatively 
more uncertain than politicians about the 
effectiveness of different policies. 

Since each agent’s type is private infor- 
mation, it is necessary to specify each agent’s 
beliefs over the other agents’ types. To be- 
gin, think of the electorate as being defined 
over the type space [0, 1]. Assume that there 
is either a finite and odd number of voters 
or a continuum of voters so that the median 
voter exists. Later analysis will show that 
the median-voter theorem applies so that 
the incumbent need only be concerned with 
the beliefs of the median voter. It is as- 
sumed that the median voter’s type is ran- 
domly drawn from [0,1] according to the 
differentiable cumulative distribution func- 
tion G(-): [0,1]-[0,1]. Thus, the incum- 
bent’s prior beliefs on the median voter’s 
type are represented by G(-). The incum- 
bent’s type as well as the challenger’s type 
are independent random draws from [0,1] 
chosen according to the continuous cumula- 
tive distribution function F(- ): [0,1] — [0,1]. 
Note that a type-a candidate believes that 
policy 1 is more effective if and only if a€ 
(5,1). Let f =1-F(5) so f is the prior 
probability that a candidate believes that 
policy 1 is more effective. I assume that 
f €(0,1). Voters’ prior beliefs on a politi- 
cian’s type and the incumbent’s prior beliefs 
on the challenger’s type are represented by 
F(-). 

The key features of the informational 
structure are that voters and politicians have 
different beliefs over the effectiveness of 
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different policies and that this is common 
knowledge. Under the common prior as- 
sumption, these assumptions are inconsis- 
tent (Robert J. Aumann, 1976). Hence, a 
rationale for the informational structure is 
that agents are endowed with distinct prior 
beliefs. The heterogeneity in prior beliefs 
may be due to genetic or cultural differ- 
ences (e.g., the biases of one’s parents) but 
are not the result of differences in informa- 
tion. While agents differ in terms of the 
formation of their priors, they all receive 
the same information over time so that no 
agent believes he can learn anything from 
what another agent believes. While this is 
clearly an abstraction, what is important is 
that it is possible for agents to know that 
their beliefs are different and yet have no 
inclination to adjust them. Perhaps it is 
more appropriate to refer to these beliefs as 
“opinions,” since people regularly agree to 
having different opinions.* Semantics aside, 
this structure is admittedly unorthodox. 
Nevertheless, I have chosen to explore it 
because I believe that its features are fun- 
damental and prevalent in many economic 
and political settings. At a minimum, it is a 
quite plausible structure and would appear 
to be consistent with some observed behav- 
ior (e.g., betting on sporting events between 
individuals).° 

In concluding this section, I will describe 
the form of players’ strategies. To begin, the 
strategy of the incumbent tells him which 
policy to implement in his first term given 
his beliefs. Hence, the incumbent’s strategy 
is of the form x(-): [0,1] {1,2}. I do not 
formally model the policy choice of the 
elected official in the second period, as it is 
known that optimality requires that he im- 


“The prevalence of differences of opinions among 
economists over optimal economic policies is docu- 
mented in Bruno S. Frey et al. (1984). 

>In models of rational agents, it is standard to 
assume that an agent knows all of the implications of 
his knowledge (see Jakkoo Hintikka, 1975; Barton L. 
Lipman, 1991). When this is not true, it is possible for 
an agent to believe something that is logically inconsis- 
tent with his body of knowledge. In that case, it is easy 
to imagine it being common knowledge among agents 
that they hold different beliefs, for they could all derive 
different implications from the same information. 
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plement the policy which he believes will 
maximize expected income.® The decision 
faced by a voter is whether to cast his vote 
for the incumbent or for the challenger. At 
the time he casts his vote, a voter knows his 
own beliefs on (w,,,), the policy chosen 
by the incumbent, and the realized level of 
income. A voter’s strategy is then of the 
form ¢(-): [0,1] {1,2}*R—{I,C} where I 
denotes “incumbent” and C denotes “chal- 
lenger.” To break indifference, I assume 
that the voter who is indifferent between 
the incumbent and the challenger will vote 
for the incumbent. This is easily justified by 
assuming that there are some small benefits 
from seniority or some small transactions 
costs associated with transition. 


II. Optimal Voting Behavior 


| begin by characterizing a _ perfect 
Bayesian equilibrium in which the incum- 
bent implements a policy if and only if he 
finds it sufficiently effective. In this section, 
I derive optimal voting behavior when the 
incumbent uses such a policy rule. In Sec- 
tion III, I show the existence of such an 
equilibrium and characterize the bias in pol- 
icy created by reelection pressures. Then, in 
Section IV, I derive necessary and sufficient 
conditions for this equilibrium to be the 
unique solution and go on to explore the 
determinants of policy. 

When the incumbent implements a policy 
if and only if he finds it to be sufficiently 
effective, his strategy takes the following 
general form, where a* is an undetermined 
variable: 


2 ifa€[0,a*] 


For now suppose that a* €(0,1). Recall 
that a type-a incumbent expects policy 1 (2) 
to generate income of a (1— a). Hence, the 


’The optimal policy for the elected official in the 
second period is well defined, generically, when @ > 0. 
When @=0, I assume that candidates have lexico- 
graphic preferences over tenure and income. 
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higher is a, the more effective the incum- 
bent finds policy 1 vis-a-vis policy 2. 

Voters conjecture that the incumbent uses 
the policy function in (1). Thus, in response 
to observing that the incumbent imple- 
mented policy 1 in his first-term, voters will 
infer that the incumbent’s type lies in (a*, 1]. 
Recall that, if reelected, the incumbent will 
implement policy 1 if and only if a >} (i.e., 
he will act to maximize expected income 
in his final term in office). As a result, vot- 
ers assign probability [1 — F(max{$, a*})]/ 
[1— F(a*)] to the incumbent implementing 
policy 1 if reelected, given that he imple- 
mented policy 1 in his first term. Note that 
no information is revealed about second- 
term policy when a* = 0 (as all incumbent 
types implement policy 1) so that voters’ 
posterior probability is the same as their 
prior. Alternatively, if a* >45 then voters 
know for certain that, if the incumbent im- 
plemented policy 1 in his first term, he will 
do so again if reelected. 

Since voters assign probability 1— F(4) to 
the challenger implementing policy 1, im- 
plementation of policy 1 in his first term 
indicates that the incumbent is more likely 
than the challenger to implement policy 1 in 
the future. As a result, when x = 1, a voter 
votes for the incumbent if and only if he 
desires policy 1 to be implemented again. 
By a similar argument, one can show that 
when x = 2, a voter votes for the incumbent 
if and only if he prefers policy 2. Therefore, 
when the incumbent uses the policy rule 
x*(-), voters tend to associate the incum- 
bent’s future policy intentions with his past 
policies. This is reflected in voting behavior 
in that a voter votes for the incumbent if 
and only if he desires to have the same 
policy implemented again. 

Having just derived voters’ posterior be- 
liefs on the incumbent’s future policy inten- 
tions, the next step is to derive their poste- 
rior beliefs on the effectiveness of different 
policies. Suppose that policy x is imple- 
mented and y is realized. Since y=, + €, 
income is then a noisy signal of w,. Let 
Elw,\y, 4] denote a type-u voter’s posterior 
expectation of w, when policy x was imple- 
mented and income was y. Under the nor- 
mality assumptions, Bayes’ Rule yields the 
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following simple form (Morris H. DeGroot, 
1970): 


(2) Elo ly,u]=py+(-p)u 
(3) 


where p=o0*/(o* +0,). 

It is known that a voter votes for the 
incumbent if and only if he desires the 
incumbent’s first-term policy to be imple- 
mented in the next period. Using (2), this 
means that a type-u voter votes for the 
incumbent when x = 1 if and only if 


(4) 


> y>y =u +(2/p)0/2)- I. 


When instead x =2, a type-u voter votes 
for the incumbent if and only if 


(S) 


> y= you) -(1/2)I. 


Using (4)-(5) and letting x € {1,2} denote 
the incumbent’s first-term policy, the opti- 
mal voting rule is 


I ify>y,(n) 


(6) 
( ) C otherwise. 


According to this voting rule, a voter votes 
for the incumbent if and only if income (i.e., 
performance) is sufficiently great. Since 
yz) is generally different from y,(w), the 
standard to which an incumbent is held 
depends on the policy implemented (as well 
as on a voter’s prior beliefs). Thus, both 
economic policy and the resulting level of 
economic activity influence voting behavior. 
The reason is as follows. A voter votes for 


3 
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the incumbent if and only if he expects 
future income to be higher when the incum- 
bent remains in office. A voter’s beliefs on 
future income depend on his beliefs as to 
the policy that will be implemented and 
how effective that policy will be. The incum- 
bent’s past policy is instrumental in that it 
provides information as to the incumbent’s 
future policy intentions while past perfor- 
mance provides information as to the effec- 
tiveness of the incumbent’s policy. As a 
result, policy and performance jointly deter- 
mine voting behavior in that a voter associ- 
ates the incumbent with the policy he im- 
plemented and thereby votes for the 
incumbent if and only if that policy per- 
formed sufficiently well. 

A noteworthy property of this voting rule 
is that a voter may vote for the incumbent 
even though he was formerly against the 
incumbent’s policy. For example, if > 4, 
then a voter initially believes policy | is best 
but would vote for an incumbent who im- 
plemented policy 2 as long as y > y,(w). In 
response to exceptional performance by a 
policy, a voter will modify his policy prefer- 
ences. Note, however, that if a voter initially 
prefers, say, policy | then it takes greater 
income for him to vote for the incumbent 
when the incumbent implemented policy 2 
than when he implemented policy 1. Given 
the voter’s prior beliefs, he is more likely to 
attribute bad performance to a bad shock 
than to a bad policy (i.e., a low value of w,) 
when policy | is implemented as opposed to 
when policy 2 is implemented. 

Finally, one can use this voting rule to 
provide insight as to when voting behavior is 
relatively sensitive to policy. For this pur- 
pose consider the difference in the reelec- 
tion criteria: 


(7) 


= 2|(1/2) 


When y,(u)= y(n), a voter requires the 
same level of income to vote for the incum- 
bent irrespective of the policy implemented. 
Hence, voting depends only on _perfor- 
mance. As |y(z)— y(u)| increases, the 
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choice of policy has a bigger impact on the 
standard applied, so that voting behavior 
becomes more sensitive to policy vis-a-vis 
performance. According to (7), the more 
uncertain are voters as to which policy is 
best (the higher is o*) and the larger the 
impact of policy on income (the lower is 
o.), the more sensitive electoral outcomes 
are to performance. In both cases, perfor- 
mance is a more accurate signal of the ef- 
fectiveness of the incumbent’s policy. Since 
voters believe that the incumbent is rela- 
tively likely to implement the same policy if 
reelected, past performance is then a more 
informative signal of future performance. 
Note that as voters become totally confident 
in their beliefs about which policy is best 
(o° > 0), ly y(n)| so that voting 
becomes purely policy-based (Rogoff, 1990). 
Alternatively, if voters are initially indiffer- 
ent as to which policy is best (yu = +), voting 
is purely performance-based (Alesina and 
Cukierman, 1990). 


III. Equilibrium Policy Bias 


Suppose each voter uses the voting rule 
in (6) and, given x and y, the median voter 
prefers the incumbent. If x =1 then, since 
y,() is decreasing in p, all those voters 
whose type is greater than or equal to the 
median voter’s type will also vote for the 
incumbent. Hence, the incumbent wins. If 
x =2, then, since y,(y) is increasing in p, 
all voters whose type is less than or equal to 
the median voter’s type will vote for the 
incumbent, so that the incumbent wins. It is 
then true that a candidate wins if and only 
if he is preferred by the median voter. From 
this point onward, yu will denote the median 
voter’s type. 

When one says that a strategy profile is a 
perfect Bayesian equilibrium, it is meant 
that there exists a set of beliefs such that 
the strategy profile and that set of beliefs 
form a perfect Bayesian equilibrium. Theo- 
rem | shows that (1) and (6) comprise an 
equilibrium when the incumbent is suffi- 
ciently policy-motivated; that is, the 
marginal utility of income, 6, is large rela- 
tive to the incremental value to holding 
office, k. 
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THEOREM 1: If and only if 0/k is suffi- 
ciently large does there exist a* €(0,1) such 
that (1) and (6) is a perfect Bayesian equilib- 
rium. If a* exists, then it is unique. 


PROOF: 

Since (6) has been shown to be an opti- 
mal strategy V » €[0, 1], it is only necessary 
to show that (1) is a best response to t*(-) 
V a €[0,1]. To begin, let A(a, x) denote the 
probability that a type-a incumbent assigns 
to being reelected, given that he imple- 
ments policy x. In deriving A(-), first note 
that, if he implements policy 1, reelection 
occurs if and only if the median voter prefers 
policy 1, which occurs if and only if 
py+U-p)w>1- Substituting a +e 
for y (since a type-a incumbent believes 
y=a+e) and solving for ¢«, a type-a 
incumbent expects to be reelected if and 
only if «>(1/p)[1—(2— Letting 
H(-) denote the cumulative distribution 
function of ¢, it follows that 


(8) A(a,1) 


Therefore, a type-a incumbent’s expected 
utility from implementing policy 1 in his 
first term is 
(9) E[Ulae,1] 
=(0a+k)+ d{A(a, max{a,i—a}+k] 
+[1-—A(a,1)] 


where f is the probability that the chal- 
lenger implements policy 1. It is straight- 
forward to show that E[U|a,1] is strictly 
increasing in a since higher a@ increases 
expected income which is directly valued 
when @ > 0 and also raises the probability of 
reelection. Similarly, one can derive 


(10) E[Ula,2] 
=@(1-—a)+k 
+ d5{A(a,2)[@ max{a,l1—a}+k] 
+[1-—A(a,2)] 
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where 
A(a,2) 


1 
{1— p)w -1]+ a)}G'(u) dp. 
0 


E[U|a,2] can be shown to be strictly de- 
creasing in a. 
Define 


w(a) = E[U\a,1]— E[Ula,2] 
= 6(2a—1) + 5A(a)[6 max{a,1- a} 


+k—(f0a+[1— f]e[1-a])] 


where A(a)= A(a,1)— ACa,2). It follows 
that (1) is a best response if and only if 
u(a)<0 V a€[0,a*] and ae 
(a*,1]. Since w&(-) is continuous and in- 
creasing in a, (1) is then a best response if 
and only if &#(a*)=0. Next note that there 
exists a* € (0,1) such that /(a*)=0 if and 
only if #(0)<0<wW()). It is straightforward 
to show that W(0) < 0 < W(1) if and only if 


(11) 0/k>« 


= max{5A(0)/[1 — fA(0)], 


This proves Theorem 1. It is important to 
note that it is possible for A(1)>0> A(0), 
in which case « < 0. In that event, 0/k >k 
is automatically satisfied since @>0 and 
k>0. 


Given this equilibrium, the first task is to 
assess the policy bias created by the incum- 
bent’s desire to improve his chances of re- 
election. Toward this end, it will prove to be 
useful to have a measure of the anticipated 
popularity of a policy. Recall that A(a@) is 
the difference between the probabilities of 
reelection from implementing policy | and 
policy 2. When A(a) > 0, the advantage from 
policy 1 could be due to policy 1 being more 
effective (which is believed to be true when 
a >) or due to the fact that it is relatively 
likely that the median voter prefers policy 1 
[roughly, G(-) puts relatively more weight 
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over (5, 1]]. In order to control for the for- 
mer effect, I propose to use A(4) as a mea- 
sure of policy popularity among voters, since 
both policies are believed to be equally ef- 
fective. I say that policy 1 is more popular 
than policy 2 if and only if A(4) > 0. 

Proposition 1 shows that some incumbent 
types do not implement the policy believed 
to be most effective and instead implement 
the policy that is expected to be more popu- 
lar. 


PROPOSITION 1: /f A(5) 20, then a* $3. 


PROOF: 

First note that o(5)=kA(5) so that 
W(4)20 as AG)Z0. If AG)>0, then 
W(4)> 0. Since is increasing in a and 
U(a*)=0, then a* <4. Hence, if A(5)> 0, 
then a* <4. By analogous means, one can 
complete the proof of Proposition 1. 


Suppose A(4)>0 so that prior informa- 
tion suggests that voters are more likely to 
be ex ante in favor of policy 1. There are 
two factors at play which determine the 
incumbent’s optimal policy. Ceteris paribus, 
reelection is more likely when the more 
popular policy is implemented as the me- 
dian voter applies a less stringent standard 
in voting for the incumbent. On the other 
hand, implementing the policy that one be- 
lieves is more effective generates higher ex- 
pected income which the incumbent directly 
values (when @> 0) and which results in a 
higher probability of achieving a given in- 
come criterion for reelection. There is no 
trade-off when the policy that the incum- 
bent believes is more effective is also the 
more popular one. Thus one finds that 
x*(a)=1 for all a €(4,1]. When a is less 
than but close to 5, the incumbent believes 
that both policies generate roughly the same 
expected income. However, since policy 1 is 
more popular, the incumbent’s probability 
of reelection is strictly higher from imple- 
menting it. As a result, these incumbent 
types implement policy 1 even though they 
find it to be less effective than policy 2. 
When a is sufficiently small, the income 
differential between the two policies is suf- 
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ficiently great that the incumbent prefers to 
implement policy 2 despite its anticipated 
unpopularity. This occurs for a < a*. 
Strategic manipulation of policy then results 
in incumbent types over [a*, +) implement- 
ing the more popular policy even though it 
is expected to generate lower income. 

By the preceding analysis, the more popu- 
lar policy will be implemented by all incum- 
bent types who believe it to be the most 
effective policy (and thus will implement it 
again if reelected) but also by some incum- 
bent types who believe it to be the weaker 
policy (and thus will not implement it again 
if reelected). As a result, the more popular 
policy is a noisy signal of the incumbent’s 
future policy intentions. In contrast, the less 
popular policy is a perfect signal of the 
incumbent’s future policy intentions, since 
he implements it only if he believes it is the 
most effective policy. Hence, he is certain to 
implement it again if reelected. 

Though the more popular policy is a noisy 
signal, it is important to emphasize that the 
incumbent’s first-term policy always con- 
tains information as to his second-term pol- 
icy as long as the incumbent is sufficiently 
policy-motivated [i.e., where x is 
defined in (11)]. This restriction on the de- 
gree of policy motivation is trivially satisfied 
when the incumbent’s prior beliefs on the 
median voter’s type, G(-), are symmetric as 
k <0. Since then A(5)=0 (so that both 
policies are equally popular), it follows from 
Proposition 1 that a* =4. In this special 
case, both policies are perfect signals of the 
incumbent’s future policy intentions. By a 
continuity argument, Proposition 2 estab- 
lishes that if neither policy is believed to be 
heavily favored by the electorate then the 
incumbent’s first-term policy will contain in- 
formation about his beliefs and thereby 
about the policy he would implement if 
reelected. Since the proof is straightfor- 
ward, it is omitted. 


PROPOSITION 2: Assume that G(-) re- 
sults in M(4)=0 and {G*(-)} converges 
weakly to G(-). Then lim, a*(G7(-)) = 5. 


It follows from Propositions 1 and 2 that 
if policies are not too disparate in terms of 
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their anticipated popularity, an incumbent’s 
first-term policy always contains information 
as to his policy if reelected. This result 
holds even if 6=0 so that politicians are 
purely office-motivated. In contrast to pre- 
ceding analyses which investigate the infor- 
mativeness of an incumbent’s policy, infor- 
mation can be conveyed even when the 
incumbent is purely office-motivated.’ Such 
an incumbent will implement the less popu- 
lar policy when he believes that it is suffi- 
ciently effective, because exceptional per- 
formance can induce voters to change their 
policy preferences. This result highlights the 
role of voters’ policy preferences being en- 
dogenous. If instead voters’ policy prefer- 
ences are fixed, then a purely office-moti- 
vated incumbent would always choose the 
more popular policy so that his policy choice 
would contain no information as to his fu- 
ture policy intentions. 

While I have shown that there exists an 
equilibrium in which the incumbent’s policy 
contains information about his future policy 
intentions, there do exist other perfect 
Bayesian equilibria. If the incumbent is 
sufficiently office-motivated, there exist both 
pooling equilibria—so that the incumbent 
implements the same policy regardless of 
his type—and nonpooling equilibria in 
which an incumbent implements a policy if 
and only if he believes it to be sufficiently 
ineffective (contrary to the equilibrium of 
Theorem 1).8 However, Theorem 2 shows 
that if the incumbent is sufficiently policy- 
motivated then (1) and (6) form the unique 
perfect Bayesian equilibrium. 


THEOREM 2: (1) and (6) is the unique 
perfect Bayesian equilibrium if and only if 


> max{6/[1—-6(1- f)],8/(1- 8f)}. 


7A possible exception is Austen-Smith and Banks 
(1989). 

nonpooling equilibrium mentioned has x(a) = 
1 V a€[0,a°] and x(a)=2 V for some 
a° €(0,1). Since all actions are implemented in equi- 
librium with positive probability, this equilibrium will 
survive any known refinement. One can also show that 
pooling equilibria survive the intuitive criterion (In-Koo 
Cho and David M. Kreps, 1987) when candidates are 
sufficiently office-motivated; that is, @ is sufficiently 
close to 0. 
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(The proof is available from the author upon 
request.) 


IV. Comparative Statics 


In this section, I perform comparative- 
static exercises with respect to the equilib- 
rium defined in (1) and (6). To ensure exis- 
tence, it is assumed that @/k is sufficiently 
large according to Theorem 1. All proofs of 
comparative statics are in the Appendix. 


A. The Effect of Policy Popularity 


In order to improve their chances of re- 
election, some incumbent types will imple- 
ment the policy that is likely to be favored 
by the median voter even though it does not 
maximize expected income. Proposition 3 
shows that this effect becomes more pro- 
nounced the more popular a policy is ex- 
pected to be. Note that if G°(-) stochasti- 
cally dominates G"’(-) in the first degree 
then there is a higher probability that the 
median voter prefers policy 1 and, by 
Proposition 3, a higher probability that the 
incumbent implements policy 1. 


PROPOSITION 3: If G"(-) stochastically 
dominates G""(-) in the first degree, then 
a*(G"(+))< a*(G"(-)). 


This theory then predicts that a policy is 
more likely to be implemented, the stronger 
is support for it as revealed through pre- 
election polls. While this result is unsurpris- 
ing, | am unaware of it having been formally 
derived as an implication of preceding mod- 
els. 


B. The Effect of Policy Motivation 


A critical property of equilibrium is that 
some incumbent types hide their true policy 
intentions by implementing the more popu- 
lar policy. This has the implication of mak- 
ing the more popular policy an imperfect 
signal of the policy the incumbent will im- 
plement if reelected. A measure of this 
ambiguity is the “number” of incumbent 
types who strategically manipulate policy: 
|a* —+|. This is a meaningful measure be- 


cause a marginal rise in |a* — 5| results in a 
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lower probability that the incumbent will 
implement the same policy if reelected. 
Since a voter votes for the incumbent in the 
hope that he will implement the same policy 
in his second term, this lower probability is 
detrimental to voters. The ensuing analysis 
assesses how changes in parameters influ- 
ence the degree of ambiguity. 

As the incumbent becomes more policy- 
motivated, he cares more about the income 
generated during his time in office. As a 
result, an incumbent will be less inclined to 
implement the more popular policy when he 
expects it to perform relatively poorly. Since 
then more incumbent types implement the 
policy they believe is more effective, an in- 
cumbent’s policy is a better signal of his 
future policy intentions, as stated in Propo- 
sition 4.” 


PROPOSITION 4: If a* 43, 
dla* —1| 7/00 <0. 


then 


C. The Effect of Voter Uncertainty 


A singular feature of the present model is 
that voters are uncertain about which policy 
is best. This has played a central role in my 
analysis as it results in voters’ policy prefer- 
ences being endogenous to performance. It 
is then of particular interest to consider 
how equilibrium behavior responds to a 
change in voter uncertainty. 

When voters become less confident in 
their prior beliefs as to the efficacy of dif- 
ferent policies (as represented by a rise in 
o*), they assign more weight to observed 
performance in updating these beliefs [see 
(2) and (3)]. As a result, their voting behav- 
ior becomes less dependent on their initial 
opinions and more sensitive to perfor- 
mance. This is reflected in the difference 
in the reelection criteria shrinking: |y (u)— 
y(w)| is decreas- 
i The effect of this change in 


| 
ing in 
voters’ standards on the incentives of the 
incumbent is as follows. Suppose a* <4 so 
that some incumbent types implement pol- 


All of the ensuing comparative statics are derived 
only when a* # , because w/(-) is not differentiable at 
@ Ss. 
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icy | even though they think it will produce 
less income. Recall that this is optimal be- 
cause the reelection criterion depends on 
both which policy is chosen and the prior 
beliefs of the median voter. If it is suffi- 
ciently likely that the median voter is pre- 
disposed to policy 1, then an incumbent 
might choose policy 1 even if he believes it 
will produce lower income on average. As I 
argued above, the difference in reelection 
criteria shrinks as voters become more un- 
certain, so that the probability of reelection 
is less dependent on the identity of the 
median voter and more dependent on how 
well the policy performs. An incumbent is 
then less inclined to implement a policy 
simply because it is likely to be initially well 
received and more inclined to implement 
the policy that maximizes income. As a re- 
sult, there are fewer incumbent types who 
strategically implement the more popular 
policy so that |a* —4| shrinks. Since it is 
more likely that the policy an incumbent 
implemented is the one he would imple- 
ment if reelected, an incumbent’s policy be- 
comes a more informative signal of his 
second-term policy. 

While we believe that this result is gen- 
eral, it has only been proved for a class of 
distributions on the median voter’s type. 
This class is defined below: 


(12) 0={G(-):[0,1]— [0,1]| 
e (0, 5] or 
G'(w) <G'(1-p) We 5]}. 


PROPOSITION 5: If a* #4 and G(-)EO 
then d\a* — | /da <0. 


Q encompasses many distributions, includ- 
ing all unimodal distributions for which the 
density function is symmetric around its 
maximum and all distributions for which the 
density function is monotonic (though the 
latter is not a particularly likely set of be- 
liefs on the median voter’s type). Note that 
the uniform distribution, which is assumed 


af 
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by Alesina and Cukierman (1990), lies in 2. 
Finally, it is important to emphasize that 
the requirement that G(-) lie in is a 
sufficient condition. For example, one can 
show that d|a* —$|/oa* <0 when G(-) lies 
in an appropriately defined neighborhood 
of the set 0.'° 


V. Empirical Implications 


As reviewed at the beginning of the pa- 
per, previous theories on the economic de- 
terminants of electoral outcomes stress the 
dominant role of an incumbent’s policies in 
determining the success of his bid for re- 
election. Performance plays a role only as a 
noisy signal of unobservable policies. In 
contrast, the theory developed here predicts 
that measures of performance are influen- 
tial in elections even when policies are ob- 
served by voters. It is predicted that the 
probability of reelection is increasing in per- 
formance (e.g., per capita real GNP) and 
increasing in the popularity of a policy 
(measured prior to its implementation). 

There are several ways to test these com- 
peting theories. One way is to estimate the 
probability of reelection as a function of 
measures of performance and a _policy- 
popularity index. This index could be a 
weighted average of approval ratings for 
particular policies (derived from public 
opinion polls). If it is not possible to con- 
struct such an index, one could substitute 
the policy instruments themselves. In con- 
trast to preceding theories, my theory pre- 
dicts that measures of performance will be 
significantly correlated with electoral out- 
comes even when the incumbent’s policies 
are controlled for in the empirical specifi- 
cation.!! 


has also been proved that <0 
when o? is sufficiently small without any restrictions 
on G(-). 


"Realistically, this will be a difficult hypothesis to 
test, due to omitted-variables bias. Some relevant pol- 
icy instruments will inevitably be left out of the empir- 
ical specification. If the included measures of per- 
formance are affected by these omitted policy 
instruments, one could get a significant coefficient on 
performance even if the theory is false. 
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To the best of my knowledge, there has 
been no empirical work that simultaneously 
controls for both policy and performance in 
assessing the determinants of electoral out- 
comes. However, there is considerable em- 
pirical evidence to substantiate the claim 
that the performance of the economy is 
correlated with voting (see the reviews in 
Kirsten Renwick Monroe [1984] and Kiewiet 
and Rivers [1985]). There has been consid- 
erably less empirical work that explores the 
impact of policies. However, what work 
there is finds that policies are influential. 
Henry W. Chappell, Jr., and William R. 
Keech (1985) show that monetary policy in- 
fluences presidential popularity, while 
Michael S. Lewis-Beck (1988) provides evi- 
dence from public opinion surveys that vot- 
ers appear to take account of policies in 
deciding how to vote. 

While one could test the present theory 
using presidential elections, an alternative 
data set could be compiled from mayoral 
elections. The president’s control over pol- 
icy instruments can be problematic when he 
faces a nonsupportive Congress, while the 
presence of other players in the policy arena 
is less of a concern for a city mayor. In this 
setting, voter uncertainty can be interpreted 
as uncertainty as to how taxes and expendi- 
tures translate into municipal services. For 
example, voters may be uncertain (and dif- 
fer in their beliefs) as to the effect of a 
change in police expenditure on the crime 
rate or the value of an increase in municipal 
services resulting from a rise in property 
taxes. Policy instruments could include 
property tax rates and expenditures on po- 
lice and education. The associated mea- 
sures of performance could include prop- 
erty values (a measure of the value of 
services to property owners), the crime rate, 
and the high-school graduation rate. The 
plan would be to estimate the probability 
that a mayor is reelected as a function of 
these policy instruments and measures of 
performance. One would also need to con- 
trol for observable variables outside of the 
control of the mayor that would affect these 
measures of performance. For example, the 
general state of the economy affects the 
crime rate and housing prices. 
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In addition to making predictions about 
the determinants of electoral outcomes, my 
theory also predicts the determinants of an 
elected official’s policies. As shown in the 
preceding section, a policy is more likely to 
be chosen, the more popular it is. Further- 
more, the policy that is more popular is less 
likely to be chosen, the more uncertain are 
voters. In translating these comparative stat- 
ics into testable hypotheses, define p as the 
proportion of people polled in favor of pol- 
icy 1. Assuming that p is a noisy estimate of 
the median voter, let G(- |p) be the incum- 
bent’s beliefs on the median voter’s type 
conditional on the public opinion poll. If 
one supposes that p’>p” implies that 
G(- |p’) stochastically dominates G(- |p") in 
the first degree, it follows from Proposition 
3 that the probability that a policy is imple- 
mented is increasing in the prior approval 
rating for that policy.'? Benjamin I. Page 
and Robert Y. Shapiro (1983) provide evi- 
dence supportive of this hypothesis in that 
they find that changes in public opinion 
cause changes in policies. Proposition 5 
states that the more popular policy (say, the 
policy with p > 4+) is less likely to be imple- 
mented the greater is voter uncertainty. If 
one assumes voter uncertainty over which 
policy is best to be greater for new policy 
issues, then the theory would predict that 
the probability that a policy is implemented 
is greater for a new issue when p< : and 
lower when p> 5. 


VI. Concluding Remarks 


An essential feature of the electoral pro- 
cess is that voters face considerable uncer- 
tainty when deciding for whom to vote. Due 
to the central role of voter uncertainty, there 
has been a growing amount of research 
which explores the incentives of elected of- 


"Selection bias could be a problem here. If it is 
known that voters are predisposed to a particular pol- 
icy, then a candidate whose track record tends to 
support that policy is more likely to be reelected. 
Assuming voters’ preferences are correlated over time, 
one could find a correlation between current support 
for a policy and the probability that an elected official 
would implement it even if elected officials always 
implement the policy they believe is best. 
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ficials to exploit this uncertainty in order to 
improve their chances of reelection. Gener- 
ally, voters have been modeled as being 
uncertain of politicians’ preferences or their 
levels of competency. A new source of voter 
uncertainty was introduced in this paper. 
Voters were assumed to be uncertain as to 
the efficacy of different policies. By being 
able to learn over time, voters’ policy pref- 
erences were found to be endogenous to the 
level of income. Naturally, voters (as well as 
politicians) were allowed to have different 
opinions as to the best policy. 

My analysis generated several new find- 
ings. In contrast to preceding work, I found 
that electoral outcomes are influenced by 
both an incumbent politician’s past policies 
and the level of economic activity during his 
time in office. Performance has a bigger 
impact on the electoral outcome, the more 
uncertain are voters as to which policy is 
more productive. Whether policy or perfor- 
mance tends to be the dominant factor in 
elections was shown to have important im- 
plications for the behavior of elected offi- 
cials. The less sensitive are electoral out- 
comes to performance relative to policy, the 
greater is the manipulation of policy for the 
purpose of reelection. Specifically, an in- 
cumbent is more likely to implement the 
policy that he believes will be well received 
rather than the policy that he believes will 
maximize expected income. This causes an 
incumbent’s past policies to be a less infor- 
mative signal of his future policy intentions. 
By manipulating policy in this manner, an 
incumbent improves his chances of reelec- 
tion and thereby affects future policy as well 
as current policy. 


APPENDIX 


PROOF OF PROPOSITION 3: 

G" stochastically dominates G°° in the 
first degree if G'(w)<G(w) V w [0,1] 
and for some p 1]. 
First note that G enters E[U|a,1] only 
through the probability of reelection [see 
(9)|. Since the integrand in (8) is strictly 
increasing in uw then A(a, 1) is strictly higher 
under G° than under G"’. Hence, E[U|a, 1] 
is strictly higher under G° than under G’. 
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Turning to A(a,2) in (10), the integrand is 
strictly decreasing in uw so that A(a,2) is 
strictly lower under G° than under G™’. 
Hence, E[U|a,2] is strictly lower under G° 
than under G”’. 

It follows that Va 
€[0,1]. Since a* is defined by wla*; 
G)=0 then Wla*(G™); 
G”) = 0. Given that /(-) is increasing in a, 
it is true that a*(G°) < a*(G"). 


PROOF OF PROPOSITION 4: 

Recall that a* is defined by W(a*)= 0. 
Given that du(-)/00#0 and w(-) is dif- 
ferentiable in 6 and a V a€(0,4)UG,)), 
then 


da* —[as(a*(0),0) 
a0 [dW(a*(0),0) /da] 


(Al) 


Since df(-)/da>0 V a€(0,4)UG, 1), it 
follows that 


(A2) sign{da* 


= sign{ — d(a*(@),0@) /00} 
= sign{2[ a* —3] 
+ 6A(a*)[max{a*,1—a*} 


~ fa—(1- f)(1-a*)}}. 


When a* €(0, 5), the right-hand side of 
(A2) is sign{2(a* — 4)[1— 5fA(a*)]}, which 
is negative as 6<1, f<1, and A(a*)<1. 
Given that a* <5 and da* /d0 <0, it fol- 
lows that —4|/00<0. When instead 
a* €(5,1), then the right-hand side of (A2) 
is sign{2(a* — 5)[1+ 6(1 — f )A(a*)]}, which 
is positive. Hence, d|a* — +| /00 <0. 
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PROOF OF PROPOSITION 5: 
Since is defined by W(a*)=0, then 


da* —[(db(a*,p)/dp)(dp /da*)| 
do? 


(A3) 


Given that dp /da* > 0, then 
sign{da* /do*} = sign{ — d(a*,p) /dp}. 


Since p enters W(-) only through A(-) and 
given w(-) is increasing in A, then 


sign{ — du(a*, p)/op} 
= sign{ — dA(a*,p)/dp}. 


show 
then 


To prove Proposition 5, I will 
that when a* #5 and G(-)EQ 
dA(a*(p), p)/dp 2 0 as a* 25. 

Suppose V 4] 
and G'(1— > for some €[0,4). I 
begin by proving that this implies a* <>}. 
Define G(-) by and 
7(1—-p)=G'U— p)— where B(u)= 
— w)- €[0, 5). Since 
G'(u)=G'— V we [0,4] then G(-) is 
symmetric around + which implies that 
a*(G)=4. Given that as 
uw Sh, then the single-crossing property is 
satisfied, which means that G(-) stochasti- 
cally dominates G(-) in the first-degree. 
From Proposition 3, it follows that a*(G) < 
a*(G) or <4. 


Given that a* <+ then dja* —5|/da? < 
0 if and only if dA(a*,p)/dp <0. It is 
straightforward to derive equation (A4), be- 
low. Since 1-24 >0 as w <4, then a suf- 
ficient condition for (A4) to be negative is 


( A4) dA(a*,p)/dp 


1 
= (2/07) -(2— H'((1/p][1 —(2— p)U — a* — dp 


= 
3 
| 
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that the integrand is negative V » €[0, 5). 
One concludes that 
> — a*)G'(n) 
Vu e[0,+) 
then dA(a*,p)/dp <0. 
It is straightforward to show that 
> a*) 


It follows that a sufficient condition for (A5) 
to hold is that — > G'(u) VW €[0, 5] 
which is true by supposition. I have then 
shown that if a* <4 then dA(a*, p)/dp < 0. 
This proves that d|ja* —4|/daa? <0. 

When VW w [0,4] and 
G'(u)> for some €[0,4) then 


a* >4, and the proof is analogous. When 
G'(u)=G'—p) V we [0,4] then a* 
which violates the condition of Proposition 
5 that a* #4. 
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The Macroeconomics of Dr. Strangelove 


By A. ANDREW JOHN, ROWENA A. PECCHENINO, AND STACEY L. SCHREFT* 


This paper examines the weapons-accumulation decisions of two adversarial 
countries in the context of a deterrence / conflict initiation game embedded in an 
overlapping-generations model. The demographic structure permits analysis of 
both within- and between-country intergenerational externalities caused by past 
weapons-accumulation decisions, as well as of intragenerational externalities 
from the adversary’s current weapons accumulation. Zero accumulation is a 
possible equilibrium with both noncooperative and cooperative behavior. Coun- 
tries may also accumulate weapons to the point where conflict initiation never 
occurs. Pareto-improving policies are generally available, but international coop- 
eration need not be Pareto-improving. (JEL 62, D74, E21) 


The Cold War has ended, and President 
George Bush has declared the beginning of 
a “new world order.” This new world order 
has so far been characterized by the intensi- 
fication of regional disputes, civil strife, and 
civil war; its implications for arms prolifera- 
tion and arms control are still unclear. This 
paper considers both superpower conflict 
and the transition to international coopera- 
tion in the context of a model that empha- 
sizes intergenerational linkages. Specifically, 
we explore the weapons-accumulation deci- 
sions of two adversarial countries in the 
context of a deterrence /conflict initiation 
game embedded in an overlapping-genera- 
tions model. The demographic structure of 
the overlapping-generations model allows 
the analysis of both between- and within- 
country intergenerational external effects 
caused by past weapons-accumulation deci- 
sions, while the two-country setting permits 
consideration of international external ef- 
fects caused by the adversary country’s cur- 


*John and Pecchenino: Department of Economics, 
Michigan State University, East Lansing, MI 48824. 
Schreft: Federal Reserve Bank of Richmond, Rich- 
mond, VA 23261. We thank David Schimmelpfennig 
for stimulating our thinking about intergenerational 
externalities, and Carl Davidson, John Goddeeris, 
Marvin Goodfriend, Daniel Hamermesh, Neil Wallace, 
and three anonymous referees for their helpful com- 
ments. The views expressed in this paper are solely 
those of the authors and do not reflect the views of the 
Federal Reserve Bank of Richmond or the Federal 
Reserve System. We take responsibility for all errors. 


rent weapons-accumulation decision. We 
analyze the noncooperative equilibrium and 
then consider the effects of a regime change 
characterized by international cooperation, 
but where the intergenerational externali- 
ties remain. 

We emphasize intergenerational external- 
ities in our model because they are intrinsi- 
cally hard to internalize: those who bear 
their cost or receive their benefit are with- 
out representatives in the present genera- 
tion. Thus their internalization requires 
intervention by long-lived government insti- 
tutions. We also analyze the problem of 
such institutions. 

The analysis draws on a number of di- 
verse economic literatures. The model uti- 
lizes the overlapping-generations frame- 
work of Maurice Allais (1947) and Paul A. 
Samuelson (1958), since this demographic 
structure lends itself to analysis of situations 
where agents’ actions have consequences 
beyond their own lifetimes. Positive inter- 
generational external effects play a role in 
models of endogenous growth (see e.g., Paul 
M. Romer, 1986; Nancy L. Stokey, 1988), 
while intergenerational diseconomies are 
the focus of Todd Sandler’s (1982) work on 
intergenerational club goods. In this paper, 
as in work on environmental externalities 
(John et al., 1992), both positive and nega- 
tive intergenerational effects interact. Fi- 
nally, the model draws upon the game- 
theoretic arms-control literature (see e.g., 
Michael D. Intriligator and Dagobert L. 
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Brito, 1984; Steven J. Brams and D. Marc 
Kilgour, 1985; Frederick van der Ploeg and 
Aart J. de Zeeuw, 1990). 

The main findings of the model are that 
equilibria in which neither country accumu- 
lates weapons are possible outcomes of both 
noncooperative and cooperative versions of 
the game and that countries may choose to 
accumulate weapons to the point where 
conflict initiation is so dangerous that it 
never occurs. The dynamic adjustment of 
the model to a steady state can be inter- 
preted as an arms race: successive genera- 
tions engage in individually rational but so- 
cially suboptimal arms accumulation. For 
second-best reasons, neither within-country 
intergenerational cooperation nor interna- 
tional cooperation necessarily leads to 
smaller weapons stocks or to Pareto im- 
provements. 


I. The Environment 
A. Preliminaries 


Consider an infinite-horizon world with 
two identical countries (called A and B). In 
each country, a new generation (called gen- 
reration t) of agents is born at each date 
t=1,2,.... There is no population growth, 
‘and each generation is normalized to be of 
size 1. Each member of generation f¢ lives 
for one period with certainty, and for two 
periods with probability ¢(-). Further, each 
has preferences defined over consumption 
when young, c(t), and when old, c°(t +1). 
These preferences are represented by a von 
Neumann-Morgenstern utility function: 


(1) U(e%(t)) + O(-)V(e%(t +1)) 


where (-) is derived below. Assume that 
U(-) and V(-) are both increasing, twice 
continuously differentiable, and strictly con- 
cave. Also assume that 


lim U'(-) 

lim V'(-) 

V(0) >0. 
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Each member of each generation ¢>1 in 
each country is endowed with e > 0 units of 
the economy’s single good when young and 
nothing when old. At date t = 1 each coun- 
try collectively possesses a stock of W(1) > 0 
weapons. 

Any portion of the date-t endowment can 
be consumed at date f, yielding utility at f, 
or used to provide for future utility. To 
accomplish the latter, the consumption good 
can either be saved or used to add to the 
home economy’s weapon stock. Each young 
agent has access to a storage technology, 
which is the sole vehicle for saving. Storage 
of s(t) units of the good at date ¢ yields 
Rs(t) units of the good at date t + 1, where 
R> 0 is the gross technological rate of re- 
turn. Alternatively, units of the good can be 
converted into an equivalent number of units 
of weapons, w(t). Accumulation of weapons 
at ¢ adds to the stock of weapons that will 
be carned into date t+1,W(t+1). The 
stock of weapons evolves according to 


(2) W(t+1)=(1—5)W(t)+ w(t) 


given the depreciation rate 6 €(0,1] and 
W(1). Weapons accumulation affects utility 
by altering the probability of survival, #(-), 
as discussed in the next subsection. 

Since a primary focus of this paper is on 
external effects across generations, we ab- 
stract from the well-understood free-rider 
problems within a country and within a gen- 
eration. We assume that the agents alive in 
each country at the beginning of each date 
elect a government for a one-period term. 
This government behaves myopically, carry- 
ing out policies made solely for the welfare 
of agents alive during its term in office. The 
government has the power to levy lump-sum 
taxes and transfers. Specifically, it levies 
lump-sum taxes on the young to achieve the 
desired stock of weapons. That is, an agent’s 
choice of weapons can be interpreted as 
arising from the collective provision of na- 
tional defense, a public good. This alloca- 
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tion could be achieved as a Lindahl equilib- 
rium.! 


B. The Game Between Countries 


Each generation of young agents plays 
a one-shot game against its counterparts 
in the other country. At the start of the 
period, young agents choose the amount of 
their endowment that they wish to convert 
to weapons, taking as given the equivalent 
choice in the other country and the inher- 
ited stocks of weapons in both countries. 
From equation (2), these decisions give rise 
to the current weapons stocks (W“,W®) in 
the two countries. (For ease of presentation, 
we suppress the time index in this subsec- 
tion.) 

At the end of the period there is a 
“potential for conflict.”? Specifically, there 
is a realization of a random variable that 
determines the probability that one side or 
the other will win, should either side decide 
to initiate a conflict.* A conflict has three 
characteristics. First, it has a winner and a 
loser; that is, there is a utility gain from 
winning and loss from losing. Second, we 
assume for simplicity that a conflict does 
not deplete either country’s stock of 
weapons. Third, a conflict can escalate into 
a full-scale catastrophic war.* 


'Note that while young agents would like to levy 
lump-sum taxes on the old, such a policy would not be 
achievable as a Lindahl equilibrium. We are grateful to 
a referee for clarifying the role of the government in 
this model. 

“The term “conflict” should be interpreted liberally; 
we think of it as any situation in which the two coun- 
tries engage in contentious behavior, ranging perhaps 
from a verbal dispute at the United Nations or an 
expulsion of diplomats, to a confrontational stand-off 
like the Cuban missile crisis or the Berlin blockade, or 
even a limited war like Vietnam or Afghanistan. 

“We assume that this probability is independent of 
which side initiates; that is, there is no first-mover 
advantage. In the problem as set out, the sides will 
never choose to initiate simultaneously. 

*These last two assumptions together suggest that if 
a conflict escalates, weapons stocks and all else are 
annihilated, while in a limited conflict the weapons 
destroyed are those that would otherwise have depreci- 
ated, so the stock bequeathed to the next generation is 
not affected. 
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Each side decides whether or not to initi- 
ate conflict. The gain from winning and loss 
from losing are proportional to utility when 
old and are the same no matter which side 
initiates. If the conflict does escalate, then 
there is mutual annihilation, giving zero 
utility in old age. The risk of this outcome 
generates the cost of initiating a conflict. 

If either side initiates conflict, the proba- 
bility of annihilation is g = q(W“,W®), with 
> 9, <9, a(x, x)= G(x, x) V 


x and g(0,0)=0. The probability that coun- 


try A wins a conflict is 


p=n+ f(W,w®) 


~ U[0, 1] 


where 7 is a random variable determining 
the potential for conflict, and f, > 0, f, <0, 
fi.<0, flx,y)=—-fly,x) VW x,y, and 


f(x,x)=0 V xe The probability that coun- 


try B wins is 1— p. The q(-) function cap- 
tures the idea that increased weapons stocks 
in either country make any conflict poten- 
tially more dangerous. The f(-) function 
captures the idea that a country’s likelihood 
of winning a conflict is higher, the more 
weapons it possesses and the fewer weapons 
possessed by its adversary. Accumulation 
that increases one country’s probability of 
winning a conflict decreases the probability 
that the other wins, and each country’s 
probability of winning equals + when the 
countries possess equal weapons stocks. We 
assume that f(-) and q(-) are continuous 
and twice-differentiable. Provided that anni- 
hilation does not occur, the agents in the 
victorious country receive utility when they 
are old equal to V(- (1+ @), and the agents 
on the losing side have utility when they are 
old equal to V(-(1— a@), where a is a posi- 
tive parameter.° 


\ssume that if p as defined exceeds 1, it takes the 
value |, and if p as defined is less than 0, it takes the 
value 0. Since we restrict attention to symmetric equi- 
libria, these restrictions are never relevant for our 
analysis 

The augmentation (diminution) of utility as a result 
of winning (losing) a conflict is analogous to the pay- 
ment of tribute by the losing side to the winning side. 
While Michelle R. Garfinkel (1990) makes the transfer 
of goods explicit, we model the tribute as a psychologi- 
cal gain (loss). 
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Now consider country A’s decision of 
whether or not to initiate conflict. If B initi- 
ates, A’s decision is irrelevant. Therefore 
we need only consider the case in which B 
does not initiate. Country A’s expected util- 
ity when old if it initiates is 


(1-q)p(it+a)V(-) 
+(1-—q)(1- p)(1-a)V(-) 


and it is V(-) if country A does not initiate. 
Thus, country A will choose to initiate if 
and only if 


p>[1+2(4)]/2 


where z(q)=q/[a(1—q)]. It is indifferent 
between initiating and not initiating if 


n= — f(-)+[1+ z(4)]/2. 


Similarly, country B is indifferent between 
initiating and not initiating if 


p=[1+2z(q)]/2 


=1- f(-)—[1+z(4)]/2. 


The probability that neither side initiates is 
then the probability that 7 lies between 
these two values, which simply equals z(q). 
Since z(q) is increasing in q and q(-) is 
increasing in both its arguments, it follows 
that greater weapons accumulation by ei- 
ther side makes conflict less likely to occur, 
but more likely to be catastrophic if it does 
occur.’ 

Now consider the first-period weapons 
choice. Country A chooses weapons (w) and 
saving (s) to maximize 


U(c’) + 6(W4,W®)V(c?) 


(subject to the budget constraints discussed 


7For an interior solution to the problem set out 
here, z(q) must be iess than 1 [= q <a /(1+a)]. For 
the present, we assume that this condition holds and 
discuss its implications later in the paper. 
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below), where 


o(W*,W®) 
= z(q)+[1— z(q)]— 9) 
X{E(p)(1+ a)+[1— 
= 2z(q)+(1— pq)[1+2af(-)] 
p=(1+a)/a. 


(Country B solves an analogous problem.) 
The first term [z(q)] is the probability that 
neither country initiates. The second term 
is the probability that someone initiates 
(1—z), multiplied by the probability that 
the world nevertheless survives (1 — q), mul- 
tiplied by the expected gain from conflict, 
which in turn depends upon the expected 
probability of victory, E(p)=4+ f(-). Ina 
symmetric equilibrium, f(-)=0, so the ex- 
pected gain from conflict is zero; in this 
case $(-) can be interpreted as the survival 
probability. Note that, since f(-) and q(-) 
are continuous and differentiable, so too is 
o(-). The behavior of the economy depends 
upon the properties of #(-), which are dis- 
cussed in detail in Section III. 


C. The Agent’s Optimization Problem 


The representative agent in country A 
(and symmetrically for the representative 
agent in country B) takes as given the re- 
turn on storage, R, the stocks of weapons at 
the beginning of period t, W4(t—1) and 
W ®(t —1), and the weapons choice of coun- 
try B, w3(t). The agent chooses c(t), 
c4°(t +1), w(t), and s4(t) to maximize 


(3) U(e%(t)) 

+ (WA(t), )V(c%(t +1) 
subject to 
(4) c%(t)+w(t)+s4(t) <e 


(5) c@(t+1)<s(t)R 


(6) WA(t)=(1-8)W4(t-1)+ w(t) 


and c4¥(t),c4(t + 1),w(t), s(t) >0. 


3 
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Utility maximization yields the following 
first-order conditions for an interior opti- 
mum: 


(7) 
(8) —U'(-)+6,(-)V(-) =0. 


The restrictions on the utility function en- 
sure that (7) is always satisfied. A necessary 
condition for (8) is that @, be positive; that 
is, an increase in w’ benefits country A. 
The assumptions of the model so far are not 
sufficient to ensure that @, is always posi- 
tive. If @, is negative, agents will not invest 
in weapons, and as shown in Section III, an 
equilibrium with zero investment in weapons 
is possible. Henceforth, we assume that ¢, 
is positive ualess otherwise noted.* 

The individual equates the marginal util- 
ity of consumption in youth with the ex- 
pected marginal utility of consumption in 
old age [equation (7)]; dR can be thought of 
as an implicit interest rate. Equations (7) 
and (8) together reveal that the individual 
also equates the return on saving to the 
return On weapons accumulation: 


bV=oRV'. 


The second-order conditions from indi- 
vidual maximization require that 


U" + dV"R? 
U"+oV'R 


U"+oV'R 
U"+4,V 


A=det 


= U"(dV"R? 6,V'R) 


+ ,,V(U" + dV"R’) 


” 


— 6V'RU' >0. 


Sufficient for this is that U” /U'+ 6, /6<0 
and <0. 


8 
We assume throughout that destruction of weapons 
in excess of natural depreciation is not possible. 
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II. The Steady State 


A symmetric steady-state Nash equilib- 
rium is given by the quintuple for country 
A, {c’,c°,w,W, s}, and the identical quintu- 
ple for country B,{c’,c°,w,W, s}, such that 
the following statements hold. 


(i) Agents maximize (3) subject to (4)-(6), 
given R and W. 
(ii) The goods market clears: 


(9) [cY+wt+s]+c°=e+sR. 
(iii) The stock of weapons is constant: 
(10) bW=w. 


The equilibrium can be characterized by 
the two first-order conditions, (7) and (8), 
and constant weapons stocks, (10). Total 
differentiation of the first-order conditions 
evaluated at an interior steady state yields 
the following system: 


i+ ds 
(11) ra 


—aw/d 
—bw/d 


dR 
b,/¢ de 
Pia / 1 || 4d 
da 


v=—V"R/V>0 
6=¢,/¢>0 
b= (61, + 20 
b, =9b /da 

= 46, /da. 


For stability the determinant of the left- 


47 
(1 su)/R 
where 
— 
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hand-side matrix, A, must be positive. This 
implies that 


A sufficient condition for A>0 is a(u—£) 
>0O and b<0. 


III. Equilibria 
A. Properties of 6(-) 


We begin this section by discussing the 
properties of #(-), since these are central 
to a characterization of the model’s equilib- 
ria. Recall that #(-) is the weight attached 
to old agents’ utility and depends upon both 
the probability of survival to old age and the 
ga’ or loss from any conflict that might 
occur: 


o(W*,W®) = 2(q)+(1— pq)(1+2af). 


Recall also that z(q) = q /[a( — q)] so that 
z’>0, z” >0. We restrict our attention to 
symmetric equilibria, in which q,=4q>, f,= 
— f,, and f =0. Then, 


(12) 
2apfa, 

=(z'—p)q,+2a(1- pa) 

(13) pq)f, 
—2apfa, 


dd(-) 


The first term in equation (12) captures 
two effects. A higher weapons stock in 
country A increases the probability of catas- 
trophe if either side initiates (— pq,) but 
also therefore increases the risk associated 
with initiating, making initiation less likely 
to occur (z’q,). The second term in equa- 


tion (12) captures the fact that a higher 
weapons stock benefits country A in the 
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FiGureE 1. THE PROBABILITY OF SURVIVAL FOR 
SYMMETRIC WEAPONS STOCKS 


event of conflict; thus, it is positive. Consid- 
ering equation (13), note that the first term 
is identical to the first term of equation (12) 
and reflects the same intuition. A higher 
weapons stock in country B is costly to 
country A, however; hence the second term 
in equation (13) is negative (since f, < 0). 

The behavior of ¢(-), evaluated at a 
steady state (with w = dW), is illustrated in 
Figure 1 under the assumption of symmetry 
across countries: w*=w?=w. At w=0, 
(0,0) =1 because g(0,0)=0, z(0) = 0, and 
f(0,0) = 0. That is, when neither country has 
any weapons, conflict is certain but costless 
because annihilation is impossible. Thus, the 
probability of survival is unity. As w_ in- 
creases, the danger inherent in any conflict 
increases, but the likelihood of conflict falls. 
The first effect dominates for low weapons 
stocks; the second effect dominates at high 
weapons stocks. The probability of survival 
is also unity if agents never initiate conflict 
because the associated risk of annihilation 
is so great. Specifically, let W be defined by 
q(w /5,W/5)=1/p. Then z(q)=1 and 
/5,W /5)= 1. This possibility resembles 
the doctrine of “mutually assured destruc- 
tion” under which deterrence works per- 
fectly because initiation of a conflict carries 
too great a risk of annihilation. 


1 
| 
w W 
| 
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FiGurE 2. DERIVATIVES OF THE SURVIVAL 
PROBABILITY FUNCTION 


Figure 2 shows the partial derivatives of 
o(-) [for arbitrary assumptions on the sec- 
ond derivatives of (-)]. It is easily con- 
firmed that at w = 0, z’(-)— p= —1, imply- 
ing that $,(-)S0, 6,(-)<90, and 
o(-)<0. Figure 2 is drawn such that 
o(-)<0 at w=0. At z(-)—p=1 
+ a> 0, implying that #,(-) and ¢,(-) are 
positive and equal. From our assumptions 
on f(-) and q(-),¢,(-) lies strictly above 
for w <wW. We thus divide the range 
of w into four regions. In the region 0 < w 
<w,, both ¢, and ¢, are negative. (Since 
¢, may be positive for all w, this region may 
be empty.) For w,<w<w, 4, is positive, 
is negative, and their sum + @,) is 
negative. The function ¢(-) attains its 
minimum at w, where ¢,+¢,=0. In the 
region w<w<wy,, is positive, is 
negative, and their sum is positive. Finally, 
for w,<w<w, both ¢, and @, are posi- 
tive. 
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Figures | and 2 are drawn such that (-) 
is single-troughed and $,(-) and @¢,(-) are 
monotonically increasing in w; this is not 
necessarily implied by our assumptions on 


f(-) and q(-), as can be seen by examining 


equations (15)-(18): 
+ 
(16) =(2'— padfir 
+ —2apfrq, —2apf\q2 


=(2z'— p)qy.+2a(1— paq)f 2 
+ 


— + a(1— pq) 


X (fit + 2"(q,) 
= (2'— p)dx +2a(1— pa) fx 
+ —4apfrq>- 


From equations (15)-(18), 67, and $45 
can all be of either sign, as can the sum 
+ O42). 


B. Steady-State Equilibria 


Our primary concern is with the equilib- 
rium weapons stock and associated survival 
probability. We thus let s(w) be implicitly 
defined by the steady-state version of the 
first-order condition (7): 


-U'(e-—s-—w) 
+ b(w/5,w/5)V'(sR)R=0. 


Increases in w increase the marginal utility 
of first-period consumption; this effect tends 
to increase saving. Below w, increases in w 
decrease #(-), which tends to discourage 
saving; the opposite is true above w.” Hence 
s'(w) is negative below w and is of ambigu- 
ous sign above w.'° 


‘The effects of changes in the survival probability on 
saving in our model are the theoretical counterparts of 
Joel B. Slemrod’s (1986, 1990) finding that an increased 
threat of nuclear war is associated with decreased 
saving. We consider the relationship between our re- 
sults and Slemrod’s further in Section IV. 

‘Specifically, s’(w) =(a—u)/(u+ v) 20. 


= 
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The two first-order conditions can be 
combined to yield 


(19) 


Steady-state equilibria for this economy can 
be represented by graphing the two sides of 
equation (19) as functions of the steady-state 
weapons stock, w. The right-hand side of 
(19) is everywhere positive, and it slopes 
upward if and only if s’(w) < 0. 

Figures 3 and 4 illustrate the cases of a 
single interior equilibrium and multiple in- 
terior equilibria, respectively. For purposes 
of exposition, we make the simplifying as- 
sumptions that $,(-)/@ and ¢,(-)/@ are 
increasing in w, and for diagramatic simplic- 
ity, we draw them as linear. While a less 
well-behaved function might divide the 
range of w into more than four regions, 
each region would still have the characteris- 
tics of one of the four described above. The 
equilibrium in Figure 3 is stable, as are the 
two outer equilibria in Figure 4.!! 

Since the right-hand side of (19) is every- 
where positive, an interior equilibrium is 
not possible in the region 0<w<w,. A 
corner solution with zero weapons accumu- 
lation is possible, however, if V’R/V lies 
above 6, /¢ at w=0 (Fig. 5), so that the 
nonnegativity constraint on weapons accu- 
mulation is binding. Note that if 6,(0,0) <0, 
then this corner solution must be an equi- 
librium. 


"Two distinct types of multiplicity may arise. First, 
there may be multiple steady states, but the game 
between countries may still have a unique equilibrium 
at any given date. Such multiplicity can be attributed to 
strategic complementarities between past generations’ 
accumulation decisions and the present generations’ 
decisions (@,; + 4,7 > 0): the more weapons that were 
accumulated in the past, the greater the return to 
accumulation in the present. The final steady state 
then depends upon initial weapons stocks. Alterna- 
tively, there may be multiple equilibria in a given time 
period, with the associated possibility of coordination 
failure. For example, each country might devote a lot 
of its endowment to weapons simply because the other 
is doing so, even if there is a better Nash equilibrium 
with less investment in weapons. This possibility arises 
if there are strategic complementarities within the pe- 
riod (¢,, > 0): the more weapons accumulated by one 
country, the higher the return to weapons accumula- 
tion in the other country. See Russell W. Cooper and 
John (1988) for more discussion of strategic comple- 
mentarities and coordination failures. 
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Figure 3. A STEADY-STATE 
EQUILIBRIUM 


V'RIV 


ove 


\ 


FiGure 4. MULTIPLE STEADY-STATE EQUILIBRIA 


Another possible corner solution has 
agents accumulating weapons to such an 
extent that initiating conflict is never bene- 
ficial and so will never occur because the 
risk of annihilation is too great; that is, 
weapons are accumulated up to w. In this 
no-conflict equilibrium, 6,/¢ lies above 
V'R/V at w=W, as is shown in Figure 6. 


| 
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Ficure 5. AN WITHOUT WEAPONS 
ACCUMULATION 


VY 
WwW 


Ficure 6. A No-Conriict EQUILIBRIUM 


Arms races, such as have been observed 
throughout history, can be interpreted in 
our model as a dynamic adjustment to a 
steady state with positive accumulation. 
Starting from some weapons stock below 
equilibrium, successive generations in each 
country engage in weapons investment that 
augments the weapons stock. Given one 
country’s investment in weapons, the self- 
interest of the other country dictates that it 
should also engage in investment. Such in- 
vestment is socially counterproductive be- 
low w, in the sense that symmetric changes 


JOHN ET AL.: THE MACROECONOMICS OF DR. STRANGELOVE 51 


in weapons stocks simply decrease the sur- 
vival probability without providing an ad- 
vantage to either country. The consequence 
is that, below w, each generation bequeaths 
a more dangerous world to generations that 
follow. If the steady-state weapons stock 
exceeds w, then the post-w buildup actually 
makes the world safer because of the deter- 
rent effect of possible annihilation. 


IV. Comparative Statics 


Now consider how steady-state saving and 
weapons accumulation are affected by 
changes in the exogenous variables of the 
model. In the following comparative-static 
experiments, we restrict our attention to 
symmetric interior stable equilibria; thus we 
consider the symmetric response of saving 
and weapons accumulation to, for example, 
an equal change in the return on storage in 
both countries. That is, we analyze the ef- 
fects on equilibrium of changes in the char- 
acteristics of the world.'* Many of the re- 
sults are ambiguous, reflecting the facts that 
the effective interest rate, (-)R, is endoge- 
nous even though the return on saving is 
fixed and that the assumptions of our model 
place few restrictions on the second-deriva- 
tive properties of #(-). 


PROPOSITION 1: (i) Increases in the re- 
turn to storage have an ambiguous effect on 
saving and weapons accumulation. (ii) If V(-) 
is logarithmic, then increases in the return to 
storage increase weapons accumulation but 
still have an ambiguous effect on saving. 


PROOF: 
(i) From (11), 


l 


1 
— = AR —1)+ {(su+1)]20. 


‘The comparative-static results for the zero- 
weapons and zero-conflict equilibria are straightfor- 
ward, since the weapons stock does not respond to 
small changes in the parameters. 


| 

V'RIV 

0 WwW 
ilo 
ds 
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(ii) If V(-) is logarithmic, then suv = 1 and 


ds 1 oa 
dR. AR s{(a-—u)|s 
dw 1 


When the return on saving rises, con- 
sumption in old age is more attractive. 
Agents can effectively shift consumption into 
the future in two ways: by increasing future 
consumption directly through saving, or in- 
directly by accumulating weapons. An in- 
crease in the return on saving increases the 
value of consumption in old age, encourag- 
ing agents to accumulate more weapons to 
benefit from this higher consumption. For 
w<w (a<0O), this extra accumulation re- 
duces #(-) and so reduces the effective 
return on saving, thus discouraging saving. 
For w>w, the saving response is ambigu- 
ous. The ambiguity stems partly from the 
familiar income and substitution effects. In 
the special case in which V(-) is logarith- 
mic, these effects cancel. 


PROPOSITION 2: Increases in the depreci- 
ation rate of the weapons stocks have an 
ambiguous effect on saving and weapons ac- 
cumulation. 


PROOF: 
From (11), 


ds w 
u(b—a)|2 


dé 

dw w 

=— —£)-b(v+ 

a(u-£)—b(vt+u)]2 
Increases in 6 decrease the aggregate 


weapons stock (i.e., dW /dé <0), affecting 
both ¢ and @,, and thus a and J, respec- 
tively. For w > w the probability of survival, 
/5,w/5), is increasing in w so that 
a> 0. Additional restrictions are needed to 
sign ds /dé and dw/d6. 
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PROPOSITION 3: Increases in the endow- 
ment of the young have an ambiguous effect 
on saving and increase weapons accumula- 
tion. 


PROOF: 
From (11), 
= = —b)| 20 
de A 
de A 


Increases in the endowment encourage 
weapons accumulation. The effect on saving 
depends upon how the increased accumula- 
tion of weapons affects the implicit interest 
rate o(-)R. If the economy is in the region 
where weapons accumulation increases 
world safety (w>w), then a>0O, so the 
increased accumulation raises the effective 
return on saving and encourages saving. For 
b> 0, however, increased accumulation in- 
creases the marginal return on accumula- 
tion (¢,), making saving a relatively less 
attractive way to increase expected old-age 
utility. 


PROPOSITION 4: Increases in the return 
to winning a conflict, a, have an ambiguous 
effect on both saving and weapons accumula- 
tion. 


PROOF: 
From (11), 
ds Pra 1 
+y)-— 


since @, <0 and ¢,, = 0. 


This result resembles that in Proposition 2, 
reflecting the fact that changes in 6 and 
changes in @ affect agents’ decisions through 
their effect on (-). The resemblence is 
apparent when it is noted that @, and ¢,, 


3 
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play the same role as a and b in Proposi- 
tion 2. 

An increase in a increases the likelihood 
of initiation and hence reduces the survival 
probability (i.e., 6, <0). Such a change can 
be interpreted as a decline in inherent world 
safety. The increased likelihood of annihila- 
tion has the direct effect of reducing saving, 
consistent with Slemrod’s (1986, 1990) find- 
ings that the increased threat of nuclear war 
is accompanied by a decrease in saving. 
Other factors in our model can make the 
overall impact of a change in a be of either 
sign, however. In particular, changes in a@ 
have an ambiguous effect on the marginal 
return to accumulation (¢,,, = 0) and so may 
encourage or discourage saving through this 
channel. 

William Shepherd (1988) argued that the 
United States and the (former) Soviet Union 
were inherently safe countries, in the sense 
that neither had much to gain from initiat- 
ing conflict. He therefore concluded that 
weapons accumulation in these countries 
was inconsistent with the needs of deter- 
rence. In our model, however, the ambigu- 
ous effect of a on , permits high weapons 
accumulation to arise in inherently safe 
economies (when ¢,, <0). Nevertheless, if 
decreases in safety increase the return to 
accumulation, so that > 0, and if u—¢ 
> 0, then we find that weapons accumula- 
tion is lower in safer worlds. 


V. Welfare Analysis 


One reason for adopting an overlapping- 
generations framework is that it permits 
explicit analysis of intergenerational welfare 
effects. In this model there are three dis- 
tinct sources of externality. First, contempo- 
raneous weapons accumulation in country 
A affects the welfare of agents in country B, 
by its effect both on the likelihood of con- 
flict and the expected gain from any conflict 
that does occur. Second, since weapons out- 
last the agents who invest in them, the accu- 
mulation decisions of previous generations 
in country A affect the weapons stock and, 
hence, the utility of the current generation 
in country A. Third, the accumulation deci- 
sions of previous generations of country A 
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affect the utility of the current generation in 
country B. These externalities differ de- 
pending upon the equilibrium weapons 
stock. The presence of externalities leads to 
the conclusion that the Nash equilibrium 
may not be Pareto-efficient. 

The only link between generations in this 
model is through the effect of weapons ac- 
cumulation, so the depreciation rate of the 
weapons stock, 6, provides a measure of the 
extent of the intergenerational externality. 
In particular, notice that if 6=1, the link 
between generations disappears, and there 
is no intergenerational externality; succes- 
sive generations simply play a series of un- 
connected one-shot games. The appropriate 
magnitude of 6 depends on how quickly 
weapons become obsolete and on the length 
of a period. 


A. First Best 


The first-best symmetric outcome in this 
model arises if a social planner can prevent 
the initiation of conflict. At any positive 
symmetric weapons stocks, conflict is a neg- 
ative-sum game by the assumptions on f(-) 
and q(-). If the planner can enforce non- 
initiation, moreover, there is no reason to 
accumulate weapons, so a zero stock of 
weapons is first-best. We henceforth assume 
that noninitiation is never enforceable and 
so restrict attention to the effects of re- 
source reallocations. Not surprisingly, even 
in the absence of enforceable noninitiation, 
the first-best symmetric allocation in this 
model entails zero accumulation of weapons, 
since this implies a survival probability of 
unity. Complete multilateral disarmament 
starting at some date + when agents pos- 
sess positive weapons stocks [i.e., setting 
w(r)= 0 in both countries for all t > 7] need 
not be Pareto-improving, however. These 
results are stated as Proposition 5. 


PROPOSITION 5: (i) Zero weapons accu- 
mulation (w = 0) is a necessary condition for 
a first-best symmetric equilibrium. (ii) If w = 
w(r)>U, or if 6 is sufficiently large, then 
complete disarmament [w(t)=0 W t>7/ is 
Pareto-improving; if w < w(r) and 6 is small, 
complete disarmament will be Pareto- 
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Ficure 7. THE First-Best EQuILIBRIUM 


improving if and only if 
U(e—w— s(w))+ 6(w/d,w/5)V(s(w)R) 
= U(e — w(r)— s(w(7))) 
+ 


where s(-) indicates the optimal choice of 
saving for a given w. 


PROOF: 

(i) Consider first any w’ > w as shown in 
Figure 7. Then by the continuity of #(-) 
there exists a w’<w such that (w”’/6, 
w"/5)= b(w' /5, w'/5). At w”, the prob- 
ability of survival is the same as at w= 
w’, but fewer resources are devoted to wea- 
pons, sO consumption is higher, and all 
agents can be made better off. Now con- 
sider w such that w>w>0O. Then a de- 
crease in w increases the survival probabil- 
ity and increases resources available for 
consumption. By the continuity of 4(-) and 
(0,0) =1, the first-best equilibrium occurs 
at w=0. 

(ii) If the initial weapons stock is below 
w, then weapons depletion increases wel- 
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fare, as noted in (i). If the initial weapons 
stock exceeds w and 6 is small, disarma- 
ment initially decreases world safety by en- 
couraging initiation of conflict and so will 
only be Pareto-improving if the resources 
freed by nonaccumulation outweigh the cost 
of a lower survival probability. In particular, 
if 6 = 0, this must be true at w, which is the 
condition stated in the proposition. If 6 is 
large, however, the economy may “jump 
over” the minimum point w; indeed, if 6 = 1, 
multilateral disarmament immediately 
moves the economy to its first-best equilib- 
rium. 


The first-best equilibrium in this model is 
identical to that in van der Ploeg and de 
Zeeuw’s (1990) model. The overlapping- 
generations framework, however, compli- 
cates the welfare analysis of disarmament 
because different generations are affected 
differently. Proposition 5 illustrates one im- 
portant conclusion of this paper: if prior 
accumulation decisions have placed the 
economy above w, then intergenerational 
externalities complicate any attempts at 
Pareto-improving disarmament (presuming, 
of course, that weapons cannot be costlessly 
destroyed). 

As a straightforward corollary of Proposi- 
tion 5, we note that if the economy exhibits 
multiple symmetric equilibria in the range 
[0,w], then these equilibria are inversely 
Pareto-ranked by the stock of weapons. 
Further, if the corner solution w =0 is an 
equilibrium, then it is evidently Pareto- 
superior to any other equilibrium. Hence- 
forth, we abstract from the possibility of 
multiple equilibria. 


B. Pareto Improvements with 
Intergenerational Transfers 


Proposition 5 shows that disarmament 
need not be Pareto-improving, since it may 
be costly to early generations. Pareto im- 
provements are generically possible if we 
permit intergenerational transfers, but such 
improvements may involve additional accu- 
mulation of weapons rather than disarma- 
ment. First, note that we can write steady- 
state utility as a function of steady-state w 


| | 
| 
| | | 
| | 
0 w 
w w’ W 
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as follows: 
T(w) =U(e-—w-—s(w)) 
+ 


where s(w) indicates optimal saving. We 
know from Proposition 5 that this has a 
global maximum at w=0. Differentiating 
and appealing to the envelope theorem, we 
obtain 


T’(w) = —U"(-) + 
=(V(-)/¢) 

+ 
using the first-order condition 
dRV'(-). 

If the economy is initially in noncoopera- 


tive equilibrium, then 6, /@=V'()R/V, 
implying 


U"(-)= 


T’(w) =V(- )[(b, — 


) 


The first term is the intergenerational exter- 
nality arising from higher inherited weapons 
stocks from both countries, while the sec- 
ond term is the within-generation across- 
country externality. The intergenerational 
externality is negative (i.e., higher weapons 
stocks reduce the welfare of future genera- 
tions) for w < w, and it is positive for w > w. 
The within-generation externality is nega- 
tive for w<w, and positive for w> wy. 
Since w, > w, it follows that both externali- 
ties are negative if the economy is in a 
noncooperative equilibrium below w. Higher 
steady-state weapons stocks in this region 
are thus associated with lower welfare. Con- 
versely, both externalities are positive if the 
equilibrium entails a weapons stock in ex- 
cess of w,, in which case higher equilibrium 
weapons stocks are associated with higher 
welfare. Propositions 7 and 8 in Section VI 
show that, in both these cases, Pareto im- 
provements are possible without intergener- 
ational transfers. 
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In the intermediate region, the intragen- 
erational externality is negative, but the in- 
tergenerational externality is positive. If 
T’(w)> 0 at a noncooperative equilibrium, 
then the intergenerational externality domi- 
nates, and higher steady-state weapons 
stocks are associated with higher utility, 
suggesting that buildup of weapons would 
be Pareto-improving. Such a conclusion is 
not quite right, however, because weapons 
buildup imposes costs on current genera- 
tions but provides benefits to future genera- 
tions. As established in Proposition 6, 
Pareto-improving buildup is possible if 


(1-5) + 6,R=(6,+ 8) /5] 


+ ,[1+(R-1)/65]>0. 


This expression differs from T’(w) by the 
addition of ¢,(R—-1)/6 <0. A high return 
to storage or a low depreciation rate thus 
means disarmament may be Pareto-improv- 
ing even if T’(w)>0. 


PROPOSITION 6: Assume that the econ- 
omy is dynamically efficient (R> 1) and in a 
noncooperative equilibrium (wN©) with 6, + 
and 6,<0 (i.e, <w,).8 
Pareto improvements are generically possible. 


PROOF: 

We first show that if ¢,1—6)+ 6,R>0, 
then a small increase in weapons is Pareto- 
improving with appropriate transfers. Let 
w(t)= wo Vt #7; w(r) + €, € small. 
Then, starting at date 7 + 1, construct trans- 
fers T(t) from the young of generation ¢ to 
the old of generation t —1 (i.e., a transfer 
from young to old at time t) such that the 
expected utility of generation ¢t—1 is un- 
changed. Recall that 


&d(W(t +1),W(t +1)) 


= 


‘These three assumptions rule out the obvious 
sources of Pareto improvements. See Propositions 7 
and 8 for more discussion. 


| 


Take a Taylor-series expansion around w = 
wNC and T(z + 1)=0 of the expected utility 
of generation 7. Using the first-order condi- 
tions, the change in expected utility is then 


b2(-)V(-)e+ o(-)V"(-)T(7 +1). 


Setting 7(7 +1) so that expected utility is 
unchanged implies 


T(7 +1) =— > 0. 
Generation 7 is made worse off by the extra 
weapons accumulation of the other country 
(¢, <0) and so must be compensated when 
old by the next generation of young agents. 

Similarly, T(7 +2) can be defined so that 


the expected utility of generation 7+1 is 
unchanged: 


+ $,)(1— +1) 
+ +2) =0 
= (6,+ Je 
— @RV'(-)T(7 +1) 
+ pV'(-)T(7 +2) =0 
= [($, + 62)(1- 6) + Je 
+ dV'(-)T(7 +2) =0. 
Likewise, T(r +3) is defined by 
+ Je U'(- +2) 
+ V'(-)T(7 +3) =0 
= (6, + $2)(1- Je 
— dRV'(-)T(7 +2) 
+ bV'(-)T(7 +3) =0 


= {(6, + 6)? + (1-8) R] + De 


+ oV"(-)T(7 +3) =0. 
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By repeated substitution, 7(7+ N) is de- 
fined by 


N 
> (1—5)"RN-" + 


n=1 


xXV(-)e+ oV'(-)T(7+N)=0 


N 
+43} 
n=1 


x RNV(-)e t+ bV'(-)T(7 + N)=0. 


As N >», and since (1— 6)/R <1, the term 
in braces tends to 


($, + 
R-(1-58) 


which is positive since ¢,(1—6)+ 6,R>0 
by assumption. It follows that, at some date 
7*, the transfers change sign; that is, mem- 
bers of generation r* — 1 make both a posi- 
tive transfer when young to the old of gen- 
eration 7* —2 and a positive transfer when 
old to the young of generation 7*. 

Now set T7(t)=0V t >7*. The welfare of 
all generations up to and including genera- 
tion r* —2 is unchanged by construction. 
Generation r* —1 is better off since it does 
not have to make the positive transfer at r* 
that would leave its utility unchanged. All 
future generations are also better off be- 
cause they neither make nor receive trans- 
fers but do inherit a larger weapons 
stock. This pattern of transfers is thus 
Pareto-improving. A similar argument holds 
if d,1—6)+4,R <0; in this case a small 
decrease in w(7) is Pareto-improving. 


Proposition 6 establishes that there is 
some critical value of the weapons stock 
above which Pareto improvements are (lo- 
cally) possible by building up weapons and 
below which Pareto improvements are (lo- 
cally) possible by reduction of weapons 
stocks. This critical value is higher, the 
higher the return on storage and the depre- 
ciation rate. 


| 
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VI. Cooperative Equilibria 
A. Cooperation Across Countries 


Consider the possibility that members of 
a given generation in countries A and B 
cooperate. As in the earlier discussion, we 
disregard the trivial case of a nonaggression 
pact whereby agents agree not to initiate 
conflict (note that such an agreement in 
general will not be time-consistent) and re- 
strict our attention to the cooperative choice 
of weapons. We think of such cooperation 
as a verifiable arms treaty that neither re- 
stricts future generations nor precludes con- 
flict in the current period. We assume, in 
keeping with our symmetry assumptions, 
that both countries have equal bargaining 
power. The problem faced by the two coun- 
tries is to choose both their weapons stocks 
to maximize (3) subject to (4) and (5), taking 
as given the inherited weapons stocks in 
both countries. The first-order conditions 
for an interior equilibrium in this problem 
are 


(20) 
(21) 


These first-order conditions are identical! 
to those from the original problem [equa- 
tions (7) and (8)], except that there is an 
additional ¢,V(-) term in equation (21). 
This indicates that the two countries inter- 
nalize the intragenerational externality. The 
intergenerational externality remains, how- 
ever. We consider this case to be of particu- 
lar interest since intragenerational external- 
ities are intrinsically easier to internalize 
than external effects across generations. 

We now consider the possibility of a 
regime change at time +. That is, we sup- 
pose that the economy is at a noncoopera- 
tive steady-state equilibrium (w°) prior to 
7, and that at all dates including and after 
7, both countries cooperate on their choice 
of weapons. Such a regime change could be 
interpreted as corresponding to the end of 
the Cold War. The following propositions 
reveal that both buildup and reduction of 
weapons stocks are possible responses and 


—U'(-) + 6(-)V'(-)R=0 


—U"(-) + (6, + o,)V(-) = 0. 
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that disarmament need not be Pareto- 
improving. '* 


PROPOSITION 7: If ¢,>0 at wX© so 
that W>w'°>w,, then agents in both 
countries will choose cooperatively to build up 
their stocks of weapons, either to an interior 
equilibrium, w©, or to W. Such buildup is 
Pareto-improving. 


PROOF: 

First consider accumulation to an interior 
cooperative equilibrium, w°, as illustrated 
in Figure 8. Let w(t)= 6W(t). From equa- 
tion (2), 


w(t+1)=(1-—6)w(t)+ dw(t). 


If, at time t, <w(t)<w°*, then the 
following inequality holds: 


b(w(t)/5) 
V'(s(W(t))R)R 
V(s(w(t))R) 


At date tf, agents choose their weapons stock, 
w(t), to equate ,)/¢@ and V’R/V, 
where V(-) and #(-) are functions of w(t) 
and the inherited weapons stock, w(t), which 
they take as given. If 5<1, then (¢,+ 
,)/ responds less to changes in w(t) 
than to changes in steady-state w, implying 
that the out-of-steady-state (6,+,)/¢ 
passes through w(t) and is flatter than the 
steady-state relationship. (In the limit as 
5 — 0, this curve becomes horizontal.) Also, 
if 5<1, changes in w(t) lead to a smaller 
increase in #(-) than do changes in steady- 
state w and so have a smaller effect on 


“The analysis of Proposition 7 uses our assumption 
that (6,+ ,5)/@ is upward-sloping. If this relation- 
ship slopes downward, but V’R/V (for an individual 
generation) still slopes upward, then the analysis is 
almost identical. In this case individual generations 
might choose weapons investment in excess of w©, the 
cooperative equilibrium, but the aggregate weapons 
stock still will not exceed w©. If both relationships 
slope downward, then we cannot rule out the possibil- 
ity of oscillatory adjustment of the weapons stock. 


| 
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FicureE 8. BurLpup To AN INTERIOR 
COOPERATIVE EQUILIBRIUM 


saving through this channel. As a conse- 
quence, the out-of-steady-state V’R/V ro- 
tates through w(t) and becomes steeper than 
its steady-state counterpart. By continuity of 
V(-) and @(-), it follows that 


w(t) <w(t) 
= W(t) <w(t+1) 


If 6=1, then there is no intergenerational 
externality, and the out-of-steady-state and 
steady-state relationships are coincident. 
Thus, 


w(t) =w(t+1) =w°. 


At the time of the regime change w(r) = 
wNC. Therefore if 5 <1, the two countries 
cooperatively build up their weapons stocks, 
which asymptotically approach w©. If 5 = 1, 
the two countries move immediately to w°. 

Buildup to W is illustrated in Figure 9. 
The analysis is similar, except that individ- 
ual generations may choose investment in 
excess of w, and so W is obtained in finite 
time even if 6<1. Buildup is Pareto- 
improving since ¢, + > 0 for <w < 


w©: a larger world weapons stock bestows a 
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$1 + 


FiGurE 9. ButLpup To A No-ConrFLIcT 
EQUILIBRIUM 


positive externality on future generations. 
Generations after 7 are better off since they 
could always choose the same allocation of 
resources as at the noncooperative equilib- 
rium and benefit from the larger inherited 
weapons stock. 

If the economy is originally in equilibrium 
above w,, then cooperation entails internal- 
ization of a positive within-generation exter- 
nality, so that agents choose to build up 
their weapons stocks. When the within- 
generation externality (¢,) is positive, then 
so too is the across-generation externality 
+ 6, — 5)/5]. The two countries thus 
cooperatively agree to forgo more consump- 
tion to make conflict less likely, and despite 
the lower consumption, cooperation is 
Pareto-improving. This result is reminiscent 
of Garfinkel’s (1990) finding that adversarial 
countries in a stochastic environment might 
accumulate weapons in a cooperative equi- 
librium. Her result obtains because, in gen- 
eral, there do not exist punishments suffi- 
ciently harsh to make not deviating from the 
cooperative agreement incentive-compatible 
in the absence of accumulation. Similarly, 
cooperation that leads to disarmament need 
not be Pareto-improving, as established in 
the following proposition. 
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FiGureE 10. REDUCTION OF WEAPONS STOCKS TO 
AN INTERIOR EQUILIBRIUM 


PROPOSITION 8: (i) If 6, <0 at wN© and 
w <wN <w,, then agents in the two coun- 
tries will choose cooperatively to reduce 
weapons stocks to an interior equilibrium or 


to w =0. This reduction need not be Pareto- 
improving. (ii) If 6, <0 at wN© and wN© < 
w, agents in the two countries will coopera- 
tively cease all weapons investment and re- 
duce weapons stocks to w = 0. This reduction 
is Pareto-improving. 


PROOF: | 

(i) Arms reduction to an interior equilib- 
rium is represented in Figure 10; the analy- 
sis is analogous to that in Proposition 7. 
Since ¢,<0 at w%, agents internalize a 
negative intragenerational externality and so 
cooperatively choose to reduce their 
weapons stocks. Since > w, 6, + > 0, 
and the intergenerational externality is 
therefore positive. Disarmament in general 
imposes costs on subsequent generations by 
decreasing the survival probability and so 
need not be Pareto-improving. Similar anal- 
ysis applies for reduction of weapons stocks 
to zero; in this case, arms reduction is ini- 
tially costly to future generations but even- 
tually benefits them. 

(ii) Reduction of weapons stocks to zero 
is represented in Figure 11. Since V’R/V 
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FiGure 11. REDUCTION OF WEAPONS STOCKS TO 
\N EQUILIBRIUM WITHOUT WEAPONS 
ACCUMULATION 


is everywhere positive and (¢,+¢,)/¢@ is 
negative, generation rt and all subsequent 
generations immediately move to the corner 
solution where w=0. Over time, natural 
depreciation of weapons allows the aggre- 
gate weapons stock to decline asymptoti- 
cally to zero. Below w, decreases in the 
aggregate weapons stock increase the sur- 
vival probability and so impose positive ex- 
ternalities on subsequent generations. Dis- 
armament is therefore Pareto-improving. 


Any individual generation does not take 
account of the effects of its actions on the 
safety of future generations. Internalization 
of the within-generation externality thus 
need not be Pareto-improving for standard 
second-best reasons: it worsens the across- 
generation externality. Note, though, that 
part (i) of Proposition 8 does not imply that 
disarmament is never Pareto-improving even 
if the noncooperative weapon stock exceeds 
w. First, if 6 =1, then there is no intergen- 
erational externality. Second, even if a gen- 
eration after t is made worse off because it 
inherits a lower weapons stock, it also bene- 
fits from the fact that it in turn is able to 
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internalize the intragenerational externality 
through international cooperation: a coop- 
erative choice of a smaller weapons stock 
frees resources for consumption. 

Our results are consistent with the model 
of Intriligator and Brito (1984), which 
demonstrates that a reduction of weapons 
stocks may promote war rather than peace.'° 
The buildup scenario is also consistent with 
Intriligator and Brito’s (1984) finding that 
buildup may increase world safety, and in 
our model, it may increase world welfare as 
well. 


B. Cooperation Across Generations 


Suppose that social planners in each 
country choose the resource allocation for 
their individual countries for all time, taking 
the rules of international interaction as 
given. This imposes within-country intergen- 
erational cooperation while sustaining the 
potential for international conflict. In the 
setting of our model, this is perhaps less 
plausible in that it requires internalization 
of externalities between agents who are not 
alive at the same time. It does however 
allow for comparison with the infinitely- 
lived-agent models in the arms-control liter- 
ature.!° We analyze this problem as a one- 
shot game in which a social planner in each 
country treats all generations symmetrically 
and chooses a constant allocation of re- 
sources through time. In this scenario, the 
planners can ensure that their countries are 
dynamically efficient and will impose a larger 
weapons stock on the world, as shown in 
Proposition 9. 


'SRecall that in our model, lower weapons stocks 
are always associated with a greater likelihood of con- 
flict, but a smaller probability of disaster should con- 
flict occur. This is because the probability of annihila- 
tion, q(-), is increasing in w, but the probability that 
neither side intitiates, z(q), is increasing in q(-). The 
trade-off between these two effects gives rise to the 
behavior of the survival probability, @(-), as w varies. 
In comparing our results to those of Intriligator and 
Brito (1984), we are interpreting “war” in our model as 
annihilation, not conflict. 

'©Intergenerational cooperation also might be a re- 
sult of altruism across generations, as in the literature 
on Ricardian equivalence. 


MARCH 1993 


PROPOSITION 9: Jf the intergenerational 
within-country externality is internalized, then 
goods will be stored only if storage is dynami- 
cally efficient. If the economy is dynamically 
efficient, the equilibrium weapons stock will 
be higher than in the noncooperative equilib- 
rium. 


PROOF: 
The planner for country A chooses w% 
and c° to maximize 
U(c’) + 


subject to 


if R<1 
it 


—y 
c°=R(e—'c"— w) 


The first-order conditions yield 


(22) U(-)= oV'(-)R_ if R>1 
and 

(23) U'(-)=$V(-)/6. 


Equation (22) establishes that storage is only 
undertaken if it is dynamically efficient, that 
is, if R>1.'’ If R<1, the planner achieves 
the optimal allocation of goods through 
transfers from young to old. The only dif- 
ference between equations (23) and (8) is 
the 1/6 in (23): @, /6 represents the total 
effect on country A of present weapons 
accumulation. As is clear from Figure 12, 
the equilibrium weapons stock given inter- 
generational cooperation, w'©, exceeds the 


noncooperative weapons stock, wN©.'® 


If the planner discounts the welfare of future 
generations, or if there is population growth, then this 
condition is amended in the usual way. 

'SIf, as can occur, V'R/V is downward-sloping, 
then it might seem that intergenerational cooperation 
would reduce weapons accumulation. In this case, how- 
ever, the equilibria being compared are neither stable 
nor unique. 
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Ficure 12. AN WITH 
INTERGENERATIONAL COOPERATION 


Since old agents’ weapons accumulation 
generates uncompensated benefits for young 
agents in their own country, it is possible 
that a social contract of the type proposed 
by Laurence J. Kotlikoff et al. (1988) could 
be developed to induce each generation to 
accumulate more weapons. Notice that, 
while this is welfare-improving from the 
perspective of a single country, it may actu- 
ally lower world welfare. Again, this is a 
second-best result. 


VII. Conclusion 


In this paper we have developed a two- 
country overlapping-generations model of 
deterrence or conflict initiation. Our model 
emphasizes the intergenerational element 
of international tensions and so highlights 
aspects of arms accumulation that have re- 
ceived little attention in the literature. We 
abstract, however, from some topics that 
have received more attention in the litera- 
ture. In particular, successive generations in 
our model piay one-shot games, removing 
the scope for repeated-game phenomena, 
such as reputation effects and punishment 
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strategies. We suspect that real-world be- 
havior is characterized by repeated games 
with intergenerational externalities. 

This paper follows in the tradition of the 
literature in that we examine the arms race 
in a two-country world. While such analysis 
was evidently appropriate during the Cold 
War era, it is less so today. Our approach 
remains useful for a number of reasons. 
First, the world may once again be domi- 
nated by two superpowers in the future. 
Second, and perhaps more importantly, our 
analysis also can be applied to regional con- 
flicts when devastation of both countries is a 
possible outcome. Moreover, our model 
lends itself to a multicountry generalization 
with coalition formation; such an approach 
may provide a useful framework for analyz- 
ing present-day conflicts. Finally, a lesson of 
our model is that, although the world may 
no longer be dominated by two superpow- 
ers, the legacy of the Cold War affects cur- 
rent decisions and current welfare. 

Despite our emphasis on intergenera- 
tional effects, some of our results resemble 
those of the arms-control literature with 
infinitely-lived agents. We find that arms- 
control (reduction) arrangements can either 
increase or decrease the possibility of inter- 
national conflict and that international ar- 
rangements leading to increases in interna- 
tional weapons stocks are successful both in 
deterring conflict and in improving world 
welfare. While we do find that having no 
weapons is always better than having some 
weapons, fewer weapons need not be bet- 
ter, even if the reduction in weapons is 
brought about by an international agree- 
ment. Although apparently counterintuitive, 
this result is firmly grounded in intergenera- 
tional trade-offs: the immediate welfare im- 
provements won by reducing the resources 
devoted to defense may be accompanied by 
uncompensated welfare losses on future 
generations. While the end of the Cold War 
may bring a peace dividend, it does not 
guarantee a safer world: current genera- 
tions still bear the consequences of past 
accumulation decisions. This we see as a 
cautionary tale for the proponents of the 
“new world order.” 


V'RIV 
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A Search-Theoretic Approach to Monetary Economics 


By Nosunwtro KryotaKi AND RANDALL WRIGHT* 


The essential function of money is its role as a medium of exchange. We 
formalize this idea using a search-theoretic equilibrium model of the exchange 
process that captures the “double coincidence of wants problem” with pure 
barter. One advantage of the framework described here is that it is very 
tractable. We also show that the model can be used to address some substantive 
issues in monetary economics, including the potential welfare-enhancing role of 
money, the interaction between specialization and monetary exchange, and the 
possibility of equilibria with multiple fiat currencies. (JEL E00, D83) 


Since the earliest writings of the classical 
economists it has been understood that the 
essential function of money is its role as a 
medium of exchange. The use of monetary 
exchange helps to overcome the difficulty 
associated with pure barter in economies 
where trade is not centralized through some 
perfect and frictionless market. Many at- 
tempts have been made in the literature to 
formalize this, with varying degrees of suc- 
cess.! In this paper, we present a search- 
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The views expressed here are those of the authors and 
not necessarily those of the Federal Reserve Bank of 
Minneapolis or the Federal Reserve System. 

'There is a voluminous literature on the founda- 
tions of monetary theory, and rather than attempting 


theoretic equilibrium model of the exchange 
process that seems to capture the “double 
coincidence of wants problem” with pure 
barter in a simple and natural way. We 
show that this gives rise to a medium-of- 
exchange role for fiat currency. We also 
show that the model can be used to address 
some substantive issues in monetary eco- 
nomics. 

Previously, in Kiyotaki and Wright (1989), 
we used a search-theoretic model to deter- 
mine endogenously which commodities 
would become media of exchange, or com- 
modity money. We were also able to con- 
struct an equilibrium with valued fiat cur- 
rency; but since that model was designed 
primarily to study commodity money, it is 
not the most tractable framework within 
which to discuss fiat money. For instance, 
we only showed that fiat money could be 
valued if it has intrinsic properties at least 
as good as the best available commodity 


to cite all of the relevant work, we refer the reader to 
the survey by Joseph M. Ostroy and Ross M. Starr 
(1990). We would, however, like to mention the contri- 
bution of Robert A. Jones (1976) and the extensions by 
Seongwhan Oh (1989) and Katsuhito Iwai (1988). Al- 
though there are many technical differences, that model 
is definitely related in spirit to the search-theoretic 
approach we describe here. In particular, there are 
heterogeneous agents and commodities, and in equilib- 
rium certain commodities are chosen as media of ex- 
change in order to reduce search costs. 
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money. In Kiyotaki and Wright (1991), we 
used an alternative search-based model to 
illustrate the robustness of monetary equi- 
libria; that is, fiat money can be valued as a 
medium of exchange even if it has intrinsic 
properties, like its rate of return, that are 
inferior to other available assets. We also 
constructed an example in that model to 
show how the use of fiat money can affect 
welfare. 

However, due to the generality of the 
specification in that paper, we were not able 
to say much about the features of monetary 
equilibria, other than that they exist and are 
robust, and our characterization of welfare 
did not proceed much beyond a numerical 
example. The model to be presented in this 
paper can be thought of as a simplified 
version of Kiyotaki and Wright (1991). Our 
first objective is to demonstrate that this 
class of models is actually very tractable. 
Our second objective is to convince the 
reader that search-based models can be used 
not just to determine which objects serve as 
media of exchange or to prove the existence 
of valued fiat money, but to address some 
more applied issues in monetary economics 
as well. In particular, we use the model to 
discuss the potential welfare-enhancing role 
of money, the interaction between special- 
ization and monetary exchange, and the 
possibility of equilibria with multiple cur- 
rencies. 

The rest of the paper is organized as 
follows. In Section I we describe the basic 
model. In Section II we characterize the 
welfare effects of money. Among other 
things, the model implies that equilibria 
where fiat money is universally acceptable 
are generally superior to nonmonetary equi- 
libria and to equilibria where it is only 
partially acceptable. In Section III we intro- 
duce specialization by producers, by assum- 
ing that they face a trade-off between 
productivity and the marketability of their 
output. The model implies that use of 
money, by making exchange easier, leads to 
more specialized and, therefore, more effi- 
cient production. In Section IV we discuss a 
version of the model that allows for multi- 
ple fiat currencies. In Section V we con- 
clude. In order to improve the presentation 


MARCH 1993 


we make some simplifying assumptions in 
specifying the basic model that one arguably 
may want to relax, and we show how to do 
so in the Appendix. 


I. The Basic Model 


The economy is populated by a large 
number of infinite-lived agents, with total 
population normalized to unity. There is 
also a large number of consumption goods. 
These consumption goods are indivisible and 
come in units of size one. We refer to them 
as real commodities, to distinguish them 
from fiat money, which is an object that no 
one ever consumes and can be thought of as 
a collection of pieces of paper or certain 
types of seashells, for example, with no in- 
trinsic value. A crucial feature of the model 
is that there is an exogenous parameter x, 
with 0< x <1, that captures the extent to 
which real commodities and tastes are dif- 
ferentiated. In particular, x equals the pro- 
portion of commodities that can be con- 
sumed by any given agent, and x also equals 
the proportion of agents that can consume 
any given commodity.* If a commodity is 
one of those that can be consumed by an 
agent, then we say that it is one of his 
consumption goods. Consuming one of his 
consumption goods yields utility U > 0, while 
consuming other commodities (or money) 
yields zero utility. 

Initially, a fraction M of the agents are 
each endowed with money while 1— M are 
each endowed with one real commodity, 
where 0 < M <1. Money may or may not 
have value. If it does, then it is convenient 
to assume that agents who are initially en- 
dowed with money are endowed with ex- 
actly one unit of real balances, so that in 
order to buy a real commodity they must 
spend all of their cash. There are two ways 
to guarantee that this is the case. First, and 


*For example, suppose there are K distinct goods 
and each agent consumes k of them; then x =k /K. 
Alternatively, suppose there is a continuum of goods 
indexed by points around a circle of circumference 1 
and each agent consumes goods corresponding to points 
in a fixed arc; then, x is the length of that arc. 
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most straightforwardly, we can simply as- 
sume that the monetary object is indivisible, 
like the real commodities in the model. Then 
if money trades at all it must trade one-for- 
one against a real commodity, and each 
agent with one indivisible unit of money will 
have one unit of real balances. Alterna- 
tively, we can assume that money is divisi- 
ble, determine the price level endogenously 
for a given stock of nominal currency, and 
endow some agents at the initial date with 
exactly enough nominal currency to consti- 
tute a single unit of real balances. We begin 
with the former approach of assuming that 
money is indivisible and take up the latter, 
slightly more complicated, approach later. 

Money and commodities are costlessly 
storable. Money cannot be produced by any 
private agent, while real commodities can 
be produced according to the following 
technology. One unit of output requires two 
inputs: a consumption good and a random 
amount of time. That is, once an agent 
consumes he enters a production process 
that yields one unit of one real commodity, 
drawn randomly from the set of all com- 
modities, according to a continuous-time 
Poisson process with arrival rate a>0. 
Thus, a measures productivity in the sense 
of average output per unit time. Note that 
agents who have not consumed cannot pro- 
duce. Furthermore, as is standard in the 
equilibrium search literature, we assume 
that agents cannot consume their own out- 
put (see e.g., Peter A. Diamond, 1982, 1984; 
Kiyotaki and Wright, 1991). This assump- 
tion helps to simplify the presentation and 
to facilitate comparison with earlier models, 
but as we show in Appendix A it is other- 
wise completely unnecessary. 

An agent who has just produced enters 
an exchange sector where he looks for other 
agents with whom to trade. Traders in the 
exchange sector meet pairwise and at ran- 
dom according to a Poisson process with 
constant arrival rate B>0.° When two 


The assumption that the arrival rate B is constant 
(and independent of the number of traders) is equiva- 
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traders meet, exchange takes place if and 
only if it is mutually agreeable, that is, if 
and only if both agents are at least as well 
off after the trade. Because there is a large 
number of anonymous agents, all trade is 
quid pro quo (there can be no IOU’s or 
other forms of private credit). We also as- 
sume that there is a transaction cost e€ in 
terms of disutility, where 0<e<U, that 
must be paid by the receiver whenever any 
real commodity is accepted in trade. This 
transaction cost implies that a trader who is 
indifferent between holding two real com- 
modities will never trade one for the other. 
For simplicity, we assume for most of the 
presentation that the transaction cost of ac- 
cepting fiat money is zero; this simplifies the 
presentation considerably but, as we show 
in Appendix B, it is possible (and interest- 
ing) to relax this assumption. 

Since exchange takes place if and only if 
mutually agreeable, an agent with either 
one unit of real balances or one real com- 
modity cannot acquire additional money or 
another commodity except by giving up his 
entire inventory. Furthermore, no agent in 
the exchange sector can produce anything 
until he trades for one of his consumption 
goods and consumes, given the specified 
technology. These observations have the fol- 
lowing implication: if each trader starts at 
the initial date with either one unit of real 
balances or one real commodity, then in 
equilibrium all traders will always have ei- 
ther one unit of real balances or one real 
commodity.’ Agents with real commodities 


lent to the assumption of a constant-returns-to-scale 
(CRS) meeting technology. That is, a CRS meeting 
technology implies that the total number of meetings 
per unit time is proportional to the number of traders, 
and so the arrival rate for a representative trader 
(which is just the number of meetings divided by the 
number of traders) is a fixed constant. We ignore 
degenerate outcomes in which there are no agents in 
the exchange sector, and hence, the arrival rate for an 
individual should he enter this sector would be 0. 
‘Note that there are no physical restrictions in the 
model against storing more than one commodity, stor- 
ing arbitrary quantities of money, or storing money and 
commodities simultaneously. Rather, these results are 
due to the assumption that consumption is a necessary 
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are referred to as commodity traders, while 
agents with fiat money are referred to as 
money traders. Let yw denote the fraction of 
traders who are money traders, so that a 
trader located at random has money with 
probability ~ and a real commodity with 
probability 1— yw. 

Individuals choose strategies for deciding 
when to accept various commodities and fiat 
money in order to maximize their expected 
discounted utility from consumption net of 
transaction costs, taking as given the strate- 
gies of others. We look for Nash equilibria. 
We restrict attention for the most part to 
symmetric equilibria, where all agents and 
all real commodities are treated the same, 
and to steady-state equilibria, where strate- 
gies and all aggregate variables are constant 
over time. To construct the set of such 
equilibria, we describe some basic proper- 
ties that they must satisfy, use these proper- 
ties to describe an individual trader’s best- 
response correspondence, and determine its 
fixed points. 

The first thing to note is that an agent 
always accepts a real commodity if it is one 
of his consumption goods, whereupon he 
immediately consumes it and enters the 
production process. Also, we claim that a 
commodity trader will never accept a com- 
modity that is not one of his consumption 
goods. This is due to the fact that in a 
symmetric equilibrium no real commodities 
are treated as special, and therefore, the 
probability of a trade offer being accepted 
by the next agent one meets is independent 
of the type of commodity one has. Hence, 
there is no advantage to trading one real 
commodity for another, and since there is a 
transaction cost ¢, unless a commodity is 
going to be consumed it will never be ac- 


input into production (an assumption we adopted from 
S. Rao Aiyagari and Neil Wallace [1991, 1992]) and the 
way we distribute the initial endowments. In principle, 
we could initially endow agents with more than one 
unit of commodities or real balances, but this would 
require solving for the steady-state inventory distribu- 
tion and would lead to potentially complicated bargain- 
ing problems in bilateral exchange. 
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cepted. This means that x is the probability 
that a commodity trader located at random 
is willing to accept any given commodity, 
and therefore x” is the probability that two 
commodity traders consummate a barter 
transaction. This is precisely William Stan- 
ley Jevons’s (1875) “double coincidence of 
wants problem” with direct barter: not only 
do you have to meet someone with some- 
thing that you want, this someone also has 
to want what you have.° 

The next thing to determine is whether 
individuals accept money. Let II denote the 
probability that a random commodity trader 
accepts money and let 7 be the best re- 
sponse of a representative individual. We 
will solve the best-response problem using 
dynamic programming. Let V; denote the 
payoff or value function for the individual in 
state j, where ;=0, 1, or m indicates that 
he is a producer, a commodity trader, or a 
money trader, respectively. Then, if r > 0 is 
the rate of time preference, Bellman’s equa- 
tions are given by 


(1) My =a(V,—-Vo) 


(2) rV, = B(1— w)x*(U-e + 


+ Buxmax7(V,, —V,) 


(3) = — e+. ¥,—¥,). 


5The result that traders never accept commodities 
that are not their consumption goods means that there 
is nO commodity money in a symmetric equilibrium. 
This is not to say that the model cannot have nonsym- 
metric equilibria, in which some real commodities do 
become media of exchange, but only that we restrict 
attention to symmetric outcomes here. Commodity 
money is analyzed in a related model in Kiyotaki and 
Wright (1989). Although a small transaction cost ¢ 
guarantees that there will be a double-coincidence 
problem in a symmetric equilibrium, the double-coinci- 
dence problem arises without transaction costs in the 
asymmetric equilibria studied in Kiyotaki and Wright 
(1989) and Aiyagari and Wallace (1991). Another way 
to guarantee that there is a double-coincidence prob- 
lem is to assume that a real commodity can only be 
stored by its producer, as in Kiminori Matsuyama et al. 
(1993), which seems natural if we interpret these com- 
modities as services rather than goods. Under this 
assumption, we can dispense with the transaction cost 
entirely. 


3 
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Equations like these are standard in 
search theory (formal derivations for a 
closely related model can be found in 
Kiyotaki and Wright [1991], for example). 
They have the following interpretation. Ac- 
cording to (1), the flow return to a pro- 
ducer, rV,, equals the rate at which output 
is produced, a, times the gain from switch- 
ing from production to exchange, V, — Vp. 
According to (2), the flow return to a com- 
modity trader equals the sum of two terms. 
The first term is the rate at which he meets 
other commodity traders, B(1— pw), times 
the probability that both want to trade, x’, 
times the gain from trading, consuming, and 
switching back to production, U «+V, — 
V,. The second term is the rate at which he 
meets money traders, Bu, times the proba- 
bility that a money trader wants to trade, x, 
times the gain from accepting money with 
probability 77, where 7 is chosen optimally. 
According to (3), the flow return to a money 
trader equals the rate at which he meets 
commodity traders, times the 
probability that both want to trade, IIx, 
times the gain from trading, consuming, and 
switching to production, U — + V, — V,,.° 

The above dynamic program depends not 
only on the strategies of others, as repre- 
sented by II, but also on py, the proportion 
of traders holding money. However, u can 
be determined as a function of I] and the 
initial endowment of money, M. Begin by 
letting No, N,, and N,, denote the propor- 
tions of the population who are producers, 
commodity traders, and money traders. 
Then the model has a dynamic structure 
with transitions as illustrated in Figure 1. 
To determine its steady state, we equate the 
flow out of and into production: 


(4) aN 


If we use the fact that the N;’s sum to 1 and 


°We have implicitly assumed that a money trader 
never accepts a commodity that is not one of his 
consumption goods, but one can show that this is 
always true in equilibrium. That is, one can show that 
the only time an agent would want to exchange money 
for a commodity that is not one of his consumption 
goods is when money is valueless, in which case he 
cannot. 
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Ficure 1. DyNAMIC STRUCTURE OF THE MODEL 


the fact that N,, = M (the number of money 
traders equals the number of agents en- 
dowed with money), (4) can be reduced to 


(5) M=ap/(ato) 
where = II) is defined by’ 


(6) 


Equation (5) is a quadratic in pw, and for 
any Me€[0,1] and I1e€[0,1] there will 
exist a unique value of uw = w(M, II) in [0,1] 
satisfying this equation. Furthermore, one 
can show that =1, 
du /OM>0, and dp /dll>0. Given p= 
u(M,I1), the unique steady state is fully 
described by 


(7) =¢/(a+¢) 


N,=(1-p)a/(a+¢). 


However, for the purpose of analyzing the 
above dynamic program, # summarizes all 
the agent needs to know about the steady 
state. If we insert = II) into (1)-(3), 


Note that ¢ can be interpreted as consumption per 
trader per unit time: it is the rate at which a represen- 
tative trader meets commodity traders, B(1 — pw), times 
the probability that a deal is consummated, which is 
the probability our representative trader has money 
and they trade, »xIl, plus the probability that our 
representative trader has a real commodity and they 
trade, (1— 
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FiGurE 2. THE BeEst-RESPONSE CORRESPONDENCE 


then, given M, this dynamic program de- 
fines a correspondence from II to best re- 
sponses, 7r. The set of equilibria is the set of 
fixed points of this correspondence. 

To characterize this set, first note that if 
Il < x then (1)-(3) imply that V,, < V,, which 
implies that the best response is 7 = 0. In- 
tuitively, if money is being accepted with a 
lower probability than a barter offer, then it 
is harder to trade using money than barter, 
and so the best response is never to ex- 
change a real commodity for money. Sec- 
ond, if Il > x, then (1)-(3) imply that V,, > 
V,, which implies 7 =1. If money is being 
accepted with a greater probability than a 
barter offer, then it is easier to trade using 
money, and so the best response is to ex- 
change a real commodity for money when- 
ever possible. Finally, if Il = x, then (1)-(3) 
imply that V,,.=V,, which implies that 7 
can be anything in [0,1]. If monetary ex- 
change and barter are equally easy then 
traders are indifferent between having 
money and real commodities, and they could 
accept money with any probability. Based 
on these results, the best-response corre- 
spondence is as shown in Figure 2, and 
there are exactly three equilibria: II = 0, 
I] = 1, and Il = x. 

The equilibrium with II = 0 will be called 
the nonmonetary equilibrium. In this case, 
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agents expect that money will be valueless, 
so they never accept it, and this expectation 
is self-fulfilling. The equilibrium with II = 1 
will be called the pure-monetary equilibrium. 
In this case, agents expect that money will 
be universally acceptable, and so they al- 
ways take it, and this expectation is self- 
fulfilling. Finally, the equilibrium with I] = x 
will be called the mixed-monetary equilib- 
rium. In this case, agents are indifferent 
between accepting and rejecting money as 
long as other agents take it with probability 
Il =x, and so partial acceptability can also 
be self-fulfilling. Alternatively, a symmetric 
mixed-strategy equilibrium where all agents 
accept money with probability x could be 
reinterpreted aS a nonsymmetric pure- 
strategy equilibrium, where a fraction x of 
agents accept money with probability 1 while 
the rest accept it with probability 0.° 


II. Welfare 


The first thing we want to do in this 
section is to compare utility across the vari- 
ous equilibria, for a given value of M. For 
the purpose of this comparison, we keep 
things tractable by restricting attention to 
the limiting case where a >. In this case 
production is instantaneous, and so all 
agents are either money traders or commod- 
ity traders: N,, = M, N,=1-—M, and p= 
M. This makes it relatively easy to solve 
(1)-(3) for the reduced-form payoffs: 


(8) WV, + M)x]}} 
(9) rV,=wW{rIl+ M)x]}} 


where # =(U— M)x /(r + BxID). 
We can now substitute I1=0, Il= x, and 
II =1 into (8) and (9) and compare utility 
across equilibria for commodity traders and 
money traders. 


8There can exist non-steady-state equilibria in this 
model where the probability that money is accepted 
varies over time. An example of a “sunspot equilib- 
rium,” in which the probability that money is accepted 
fluctuates randomly over time even though the funda- 
mentals are nonstochastic and time-invariant, is con- 
structed in Kiyotaki and Wright (1990). 
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If we let the superscripts N, M, and P 
refer to the nonmonetary, mixed-monetary, 
and pure-monetary equilibria, respectively, 
then we have the following results. First, 
commodity traders are equally well off in 
the nonmonetary and mixed-monetary equi- 
libria and strictly better off in the pure- 
monetary equilibrium: = VM <V,". Sec- 
ond, money traders are strictly better off in 
the pure-monetary equilibrium than in the 
mixed-monetary equilibrium and_ strictly 
better off in the mixed-monetary equilib- 
rium than in the nonmonetary equilibrium: 
VN <V.M<V,?. Thus, given the initial en- 
dowment of money and real commodities, 
all agents are at least weakly better off if 
money is acceptable than if it is not, and all 
agents are strictly better off if money is 
universally acceptable than if it is only par- 
tially acceptable.” 

The next thing we want to do is to exam- 
ine how utility varies with M, and for the 
purpose of this comparison we return to 
the general case where a need not be ~. 
The experiment we consider is to increase 
the number of agents initially endowed with 
money and to reduce the number initially 
endowed with real output, so that we can 
maintain the tractability of the unit-inven- 
tory assumption. In either the nonmonetary 
equilibrium or the mixed-monetary equilib- 
rium, all agents are better off the lower is 
M. The reason is that, in these equilibria, 
money does nothing to ameliorate the 
double-coincidence problem, and so it is 
better to endow everyone with real con- 
sumption goods rather than intrinsically 
worthless paper or seashells. The more in- 
teresting case is the pure-monetary equilib- 
rium, where fiat currency does have a gen- 
uine role to play in facilitating exchange. 


°’These results differ from those in Kiyotaki and 
Wright (1990), where we assumed that agents initially 
endowed with fiat currency would freely dispose of it 
and produce a new commodity in the nonmonetary 
equilibrium. This made the initial stock of real com- 
modities differ across monetary and nonmonetary equi- 
libria and therefore made welfare comparisons am- 
biguous. Following Aiyagari and Wallace (1992), we 
assume here that agents initially endowed with fiat 
money cannot produce until they consume, which keeps 
the initial stock of output constant across equilibria. 
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To pursue this, define the welfare crite- 
rion 


(10) W=NVotNV,+ 


This can be interpreted as the ex ante ex- 
pected utility of all agents before the initial 
endowment of money and output is ran- 
domly distributed among them. After some 
algebra, (10) can be simplified to yield 


(11) rW =(U-e)ga/(at¢) 


where = g(u,II) was defined in (6) 
above.'” Now consider maximizing W with 
respect to M. Since W is increasing in ~, we 
proceed by finding the value u° that maxi- 
mizes » with respect to w, and then deter- 
mine the optimal value M° from the 
steady-state condition (5), M= ap /(a + ¢). 

The result is as follows: if x>4 then 
u°=0, which implies M°=0; if x <4 
then =(1-—2x)/(2—2x), which implies 
M° > 0. Intuitively, when x >4, each agent 
is willing to consume (and therefore accept) 
at least half of the commodities produced in 
the economy, and pure barter is not very 
difficult. In this case the role for a medium 
of exchange is not very important, and it is 
optimal to endow everyone with real output 
and no one with money. When x <5, on the 
other hand, pure barter is sufficiently dif- 
ficult that the introduction of some fiat 
money improves welfare, in spite of the fact 
that, in the experiment under consideration, 
endowing some agents with money requires 
endowing fewer agents with real output at 
the initial date. We also note that M° is 
decreasing in x, and that M°>4 as x > 0. 
Thus, as x shrinks and the doubie-coinci- 
dence problem becomes more difficult it is 
optimal to endow more agents with money. 

We now turn to a version of the model in 
which money can be interpreted as being 
divisible rather than indivisible and investi- 
gate the welfare implications of a particular 
mechanism for determining the price level, 


~ Notice that rW equals the difference U —« times 
aggregate consumption, since g is consumption per 
trader per unit time and a/(a+g)=N,+N,, is the 
number of traders. 
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P. We look for a pure-monetary equilibrium 
in which each money trader carries P units 
of cash and all P units are required to 
purchase one real commodity. Then real 
balances are given by M=C/P for any 
exogenous stock of nominal currency C. Of 
course, to make P endogenous we need to 
impose an additional equilibrium condition. 
Consider the method used by Diamond 
(1984) in his cash-in-advance search model, 
which is to impose as an equilibrium condi- 
tion that the gains from trade for a com- 
modity trader and a money trader are 
equalized whenever an exchange is made 
between them: 


(12) 


m 


The left-hand side is the gain from trade for 
a commodity trader who accepts money, 
while the right-hand side is the gain for a 
money trader who acquires one of his con- 
sumption goods. 

If a pure-monetary equilibrium satisfies 
condition (12), we call it a split-the-surplus 
equilibrium. Notice that both sides of (12) 
depend on uy. If wu is large, there are many 
money traders and few commodity traders, 
so having money is not very desirable; hence, 
a commodity trader who acquires money 
gets a smaller gain than a money trader who 
acquires one of his consumption goods. 
Thus, for large w the right-hand side of (12) 
exceeds the left-hand side, and we need to 
reduce pw until either (12) holds or we hit 
uw =0. Inserting the reduced-form payoff 
functions and simplifying, the unique value 
of u that satisfies (12) is given by 


(13) =(1-2x)/(2—2x) 


+" x). 


If r<Px(1-—2x) then p*>0, which im- 
plies a unique M* > 0 satisfying (5), and a 
finite equilibrium price level P* =C/M*. 
If r > Bx(1—2x), then (12) cannot be satis- 
fied for any value of u > 0. In this case, we 
say that the split-the-surplus equilibrium 
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FiGurE 3. VALUE FUNCTIONS AND WELFARE 


entails M* =0 and P =~, and hence there 
can be no monetary exchange. 

Recall that the value of uw that maximizes 
W is w° =(1—2x)/(2—2x), and comparing 
this with (13) we find u° > u*. This means 
the split-the-surplus equilibrium yields a 
lower value of uw, and hence a lower value 
of M and a higher value of P for any given 
C, than that which maximizes ex ante util- 
ity. However, the split-the-surplus equilib- 
rium is still ex post Pareto optimal. To see 
why, consider the value functions V,, and V, 
(V, is proportional to V, and need not be 
considered independently). One can show 
that, as functions of M, both are concave, 
and V, is increasing but V,, is decreasing at 
M = M* (see Fig. 3). Hence, any movement 
away from M* that makes commodity 
traders better off makes money traders 
worse off, and vice versa. The split-the- 
surplus equilibrium is therefore ex post 
efficient even though it fails to maximize 
ex ante welfare."! 


'l Figure 3 indicates that both commodity and money 
traders prefer a lower value of M than that which 
maximizes W. This ostensibly paradoxical result can be 
understood by noting that the number of agents of 
each type varies with M. 


1M 
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III. Specialization 


An insight dating back at least to Adam 
Smith (1776) is that specialization is limited 
by the extent of the market and that the use 
of money encourages specialization by en- 
larging the extent of the market. As Smith 
puts it: 


When the division of labour has been 
once thoroughly established, it is but a 
very small part of a man’s wants which 
the produce of his own labour can 
supply. He supplies the far greater 
part of them by exchanging that sur- 
plus part of the produce of his own 
labour, which is over and above his 
own consumption, for such parts of 
the produce of other men’s labour as 
he has occasion for. Every man thus 
lives by exchanging, or becomes in 
some measure a merchant, and the 
society itself grows to be what is prop- 
erly a commercial society. 

But when the division of labour first 
began to take place, this power of 
exchanging must frequently have been 
very much clogged and embarrassed in 
its operations. One man, we shall sup- 
pose, has more of a certain commodity 
than he himself has occasion for, while 
another has less. ... But if this latter 
should chance to have nothing that 
the former stands in need of, no ex- 
change can be made between them. 

In order to avoid the inconve- 
niency of such situations, every pru- 
dent man in every period of society, 
after the first establishment of the di- 
vision of labour, must naturally have 
endeavored to manage his affairs in 
such a manner, as to have at all times 
by him, besides the peculiar produce 
of his own industry, a certain quantity 
of some one commodity or other, such 
as he imagined few people would be 
likely to refuse in exchange for the 
produce of their industry. 

[1937 pp. 22-3] 


Smith is suggesting that specialization, 
while it may have desirable consequences in 
terms of productivity, makes barter difficult. 
Whenever they can, specialized producers 
will therefore tend to sell their output for a 
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more generally acceptable medium of ex- 
change, which they will then use to buy 
whatever consumption goods they desire. 
Hence, specialization leads to a greater role 
for money, while at the same time the use 
of money affords a greater opportunity for 
specialization by facilitating the process of 
exchange. In order to formalize this, we 
introduce a trade-off between productivity 
and marketability by assuming that the ar- 
rival rate in the production process is a 
function of the number of agents willing to 
consume the output: a = a(x), where a’ < 0. 
The idea is that, by becoming more special- 
ized, a producer can increase output per 
unit time, a, but only at the cost of reducing 
the fraction of consumers who will accept 
his output in exchange, x.!? 

Before entering the production process, 
agents choose x, taking as given the behav- 
ior of others. If money is accepted with 
probability I] and other producers’ deci- 
sions imply that a given individual can con- 
sume a fraction X of their output, his pay- 
off if he chooses x is described by 


(14) =a(x)[V,(x)- Vo] 
(15) V(x) 
= Xx[U-e +V)-V(x)] 


+ BuxlI[V,, 


(16) 


The choice of x will be made by a producer 
to maximize the right-hand side of (14), and 
this x is then carried over to the exchange 
sector as a state variable. As was the case 
earlier, this best-response problem also de- 
pends on p, but p will be determined below 


One interpretation is that each consumer derives 
utility from a fixed set of characteristics embodied in 
some commodities, and a larger value of x implies that 
the producer’s output contains a greater number of 
characteristics and hence has a larger potential market. 
Eduardo Siandra (1990) has independently developed a 
very similar model. Robert King and Charles Plosser 
(1986) and Harold L. Cole and Alan C. Stockman 
(1992) provide other analyses of the interaction be- 
tween money and specialization. 
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using the steady-state conditions as a func- 
tion of strategies and M. 

Equations (14)-(16) can be solved for Vp, 
V,, and V,,. In particular, after simplifica- 
tion, we find that 


(17) a(x)[V,(x)- 
= pxa(x)/f(x) =Z(x) 


where 
é(x)=[r+ BU pw) XT + + Buxll) 


and ¢ does not depend on x. The individ- 
ual choice of x can be found by maximizing 
Z(x). If we assume an interior solution, the 
first-order condition Z'(x)=0 can be rear- 
ranged to yield:'° 


xa'(x) 


r+a(x) 
r+ B(1—p) Xx + Buxll 


It can be shown that the second-order con- 
dition Z” < 0 holds if we assume that a” < 0. 
Then (18) completely characterizes the indi- 
vidual’s choice of x, given X, Il, and uw. We 
write x = x(X,y, II). 

For a symmetric equilibrium we must have 
X =x, or X = x(X,y,I1). Another equilib- 
rium condition comes from the steady-state 
equation M=ayp /(a+¢). Since the right- 
hand side of this equation depends on X, 


'3One can also maximize the right-hand side of (14) 
directly by setting 
a’(x)[V(x)- Yo] + a(x)Vi{(x) =0 
where V;(x) is, from (15), 
Vi(x)= rV,(x)/{ rx + 


Manipulating these equations yields the same first- 
order condition as in the text, equation (18). 
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mw, and II, we write it as M = M(X,y, II). 
Given M, an equilibrium is a solution to 
X = and M=MC(X,y,I1) with 
either I1=0, X, or (since, for 
any given X, the model has a nonmonetary 
equilibrium, a mixed-monetary equilibrium, 
and a pure-monetary equilibrium, exactly as 
in the model without endogenous special- 
ization). In Figure 4 we draw the locus of 
points in (yu, X)-space satisfying each of 
these conditions. Notice that the M= 
MC(X,,II) curve is upward-sloping, shifts 
to the right as II increases, and goes through 
(M,0). Also, the X = x(X,y,II) curve is 
upward-sloping, horizontal, or downward- 
sloping depending on whether II = 0, X, or 
1, and goes through the same intercept 
(0, X,) in any case. The intersection of these 
two curves determines the equilibrium val- 
ues of w and X in the nonmonetary equilib- 
rium, the mixed-monetary equilibrium, or 
the pure-monetary equilibrium, depending 
on whether II = 0, Il = X, or [1 =1."* 
Again let the superscripts N, M, and 
P represent the nonmonetary, mixed-mone- 
tary, and pure-monetary equilibria. As can 
be seen from the diagram, specialization is 
greatest in the pure-monetary equilibrium, 
lower in the mixed-monetary equilibrium, 
and lowest in the nonmonetary equilibrium: 
X?<X™< XN. The intuition behind this 
result is that when money circulates there is 
less of an advantage to having a high value 
of x, since it does not necessarily require a 
double coincidence of wants in order to 
exchange. We can also ask how specializa- 
tion depends on M. An increase in M shifts 
the M = MCX, y, II) curves to the right but 
does not affect the X = x(X,y,II) curves. 
As can be seen from the diagram, when M 
increases, the result is an increase in X in 
the nonmonetary equilibrium, a decrease in 


'4Notice that the pure- and mixed-monetary equilib- 
ria must be unique, but since Il = 0 implies that both 
the X= x(X,y,II) curve and the M=M(X,y,T) 
curve are upward-sloping, they could irtersect more 
than once, and there could be more than one nonmon- 
etary equilibrium. Although examples with multiple 
nonmonetary equilibria can be constructed, we rule 
this out in the following discussion. 
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Ficure 4. M anp X Curves, WHOSE INTERSECTIONS DETERMINE 
EQUILIBRIUM VALUES OF 4 AND X 


X in the pure-monetary equilibrium, and no 
change in the mixed-monetary equilibrium. 
Roughly speaking, an increase in M in the 
pure-monetary equilibrium encourages spe- 
cialization because producers can more eas- 
ily market their specialized output when 
there is more money in circulation.'° 
Consider now the effect of increasing the 
arrival rate in the exchange sector, 8, which 
can be thought of as reducing the frictions 
associated with trade (or “increasing the 


'SThe discussion of the effect of an increase in M 
takes the real money supply to be exogenous, say, 
because the monetary object is indivisible. Alterna- 
tively, we can assume that money is divisible and deter- 
mine the level of real balances endogenously in pure- 
monetary equilibrium, given nominal balances, using 
the split-the-surplus condition discussed above. In 
terms of Figure 4, we need to shift the M = MCX, yn, 1) 
curve until the gains from trade for commodity traders 
and money traders are equalized. One can show that 
there exists a unique split-the-surplus equilibrium, and 
it implies a finite price level under appropriate param- 
eter restrictions, as in Section II. 


extent of the market”). In any pure-mone- 
tary equilibrium, an increase in B shifts the 
X = x(X,u,1) curve down and shifts the 
M(X,u,1) curve to the right, resulting 
decline in X and therefore an increase 
specialization and productivity. As B >, 
x—0, and specialization becomes com- 
plete. As this happens, barter becomes ex- 
tremely difficult, and the ratio of the volume 
of barter to monetary exchange vanishes.'® 
In the limit, agents almost always sell their 
production goods for money and use money 
to buy their consumption goods; as Robert 
W. Clower (1965) puts it, “money buys goods 
and goods buy money; but goods do not buy 
goods.” In this model, however, there is no 
constraint that agents must use cash. To 


rate of barter exchange is B(1— while 
the rate of monetary exchange is Bu(1 — )x. The ratio 
of these two is (1— w)x /p, which vanishes as x > 0 
(note that uw is bounded below by M). For any finite B, 
however, there will always be some direct barter in 
equilibrium. 


— 
xP 
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the contrary, it is because the economy has 
settled on the use of a generally acceptable 
currency that specialization becomes prof- 
itable, and it is specialization that inhibits 
barter. 


IV. Dual Currency Regimes 


In this section we take up the possibility 
of multiple fiat monies. It is motivated by 
the observation that, in some economies, 
there seems to be more than one type of 
currency in simultaneous circulation. For 
instance, it is possible in certain locations to 
have both a domestic currency and a foreign 
currency used in exchange, although per- 
haps the former is generally acceptable while 
the latter is only partially acceptable. One 
example is that Canadian dollars are often 
accepted just across the U.S. border, and 
vice versa, although the foreign currencies 
are not always accepted by domestic resi- 
dents. Furthermore, this situation can per- 
sist even if the two currencies differ in terms 
of rates of return or other intrinsic proper- 
ties. 

In order to study the phenomenon of 
dual-currency equilibria, we assume that 
there are now two colors of fiat money: red 
and blue. To simplify the presentation as 
much as possible, we only consider the case 
in which specialization is exogenous, and we 
assume that both monies are indivisible. If 
we endow all agents with either one unit of 
red money, one unit of blue money, or one 
real commodity at the initial date, then all 
agents will always hold one and only one of 
these objects at all future dates as well. We 
give the monies potentially different intrin- 
sic properties by letting yp and y, denote 
flow yields or dividends; that is, each money 
yields y; “utils” to its bearer per unit time 
(if y,;<0 then it can be thought of as a 
storage cost). Also, let the supplies of the 
two monies be Mz, and Mg, with M,+ 
M, <1, let wp and py, be the proportions 
of traders with red money and blue money, 
and let uc = 1— Wp — My be the proportion 
of traders with real commodities. 

To formulate the representative individ- 
ual’s best-response problem, let the proba- 
bilities of random commodity traders ac- 
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cepting red money and blue money be Il, 
and II,. Then Bellman’s equations are de- 
scribed by 


(19) My =a(V,-Vo) 


(20) 


+ 
TR 

+ Bugx — Vj) 
TB 


(21) 


(22) +Vy— Vz). 


These depend on pp and pp, but as above, 
the steady-state conditions can be solved for 
unique values of wp and pp, given strate- 
gies and exogenous values of Mz and M,."” 
Our goal is to construct an equilibrium in 
which both monies circulate, but with dif- 
ferent acceptabilities: 1 = II, > Il, > 0. This 
requires Vp >V,=Vy,. Now, Vp>V,_ fol- 
lows immediately from II, =1. Further- 
more, for the case in which yp = y,=0, 
(20)-(22) imply that V, = V, if and only if 


(23) 


where 
BxpR). Notice that A>1. If I, =1 and 
Il, = Ax, we have an equilibrium in which 
red money is universally accepted while blue 
money is only partially accepted. By conti- 
nuity, we can perturb yp and y, without 
destroying the equilibrium, as long as ly,| 
and |y,| are not too great. In particular, we 
can construct equilibria with 1=II, >I, 
even though yp <y x. In such an equilib- 
rium, both monies circulate, but the high- 
return asset is less acceptable or less liquid 


'’The way we write Bellman’s equations implicitly 
assumes that agents never trade one currency for an- 
other, which is true in equilibrium because such a 
trade could not possibly make both agents better off. 


3 || Il, = Ax 
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than the low-return asset. That is, the rea- 
son why red money is universally accept- 
able, even though it is dominated in rate of 
return, is that it has liquidity value. If the 
spread yg — Yr becomes too big, however, 
this equilibrium can no longer exist.!® 


VI. Conclusion 


We have presented a model of exchange 
in which the difficulty of pure barter leads 
to a transactions role for fiat currency, and 
we have used the model to address several 
issues in monetary economics.'? Other ap- 
plications can also be studied in this frame- 
work. In Matsuyama et al. (1993), an ex- 
plicit two-country version of a model similar 
to the one presented here is considered and 
used to investigate some issues relating to 
international monetary theory. In Steve 
Williamson and Wright (1991), a “lemons” 
problem is introduced into the model and 
used to illustrate the role of fiat currency in 
helping to overcome the frictions associated 
with private information. Siandra (1990) 
considers further the relationship between 
specialization and monetary exchange. 
Victor E. Li (1991) pursues some issues 
relating to externalities, welfare, and policy. 
Ramon Marimon et al. (1990) use a related 
model to analyze learning. Aiyagari and 
Wallace (1992) consider several other appli- 
cations. Although there are many unan- 
swered questions and much work remains 
to be done, we think that these search- 
theoretic models have definitely enhanced 
our understanding of the exchange process, 
in general, and of money, in particular. 


'8Similar arguments can be used to show that there 
are equilibria in which both monies are universally 
acceptable even though one has a higher rate of return, 
and equilibria in which one money circulates but the 
other does not even though one or the other has a 
higher rate of return. Examples with two circulating 
fiat currencies have also been constructed by Aiyagari 
and Wallace (1992). 

In order to focus on more substantive issues, we 
have neglected many of the technical aspects of 
search-based exchange models, like the possibility of 
multiple dynamic equilibria, including sunspot and 
cyclical equilibria (see Kiyotaki and Wright, 1990; 
Michele Boldrin et al., 1991; Timothy J. Kehoe et al. 
1991). 
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APPENDIX A 


Here we sketch a version of the model 
that makes it precise why agents need to 
trade, without assuming that they cannot 
consume their own output. This is perhaps 
more satisfying, but it does entail an in- 
crease in notation. The implications of this 
version of the model are essentially the same 
as those described in the text. For simplic- 
ity, we consider only the case where a =, 
but it should be clear how to handle the 
more general model. 

Suppose there are K types of agents, with 
equal numbers of each type, and K com- 
modities, where K > 3. Agents are special- 
ists in production but generalists in con- 
sumption, in the following sense. Each agent 
can produce only some of the commodities 
—to ease the presentation, suppose each 
type can produce exactly one commodity— 
but has a need to consume different things 
at different points in time. In particular, 
after consuming one commodity a consumer 
realizes a taste or need for a new commod- 
ity drawn at random. That is, the probability 
that the new commodity will be j is 1/K for 

y. ,K. A consumer with a need 
for commodity j gets utility U from consum- 
ing it and no utility from anything else, at 
least until j is consumed and a new taste 
shock is realized. 

Consider a representative agent. After 
consumption, with probability 1/K he needs 
the commodity he can produce, consumes 
immediately, and draws a new taste shock, 
while with probability (K —1)/K he needs 
something else and must attempt to acquire 
it through trade. Therefore, the expected 
value of drawing a new taste shock, say V,,, 
will satisfy V, =(U+V,)/K —-1)/K. 
Eventually, our representative agent needs 
something that he cannot produce. When 
he meets a potential trading partner, the 
probability that this partner needs the good 
our representative agent produces is 1/K, 
while the probability that this partner also 
has the good our representative agent needs 
is 1/(K —1), since he must have one of the 
commodities other than the one that he 
himself needs. Hence, the probability of a 
double coincidence is 1/ K(K — 1). This im- 
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plies that the value functions for commodity 
and money traders satisfy 


M) 
BM 
(Vn— Vi) 
1—M 


The rest of the analysis is the same as the 
model in the text. 


APPENDIX B 


Here we show how to relax the assump- 
tion of a zero transaction cost on accepting 
money, where, for simplicity, we again con- 
sider only the case where a=. Let 7 
denote the disutility cost of accepting cur- 
rency, and assume that 0 <7<U-—e. The 
value functions in this generalized model 
satisfy 


rV, = M)x?(U-e«) 
+ BMx7(V,, 1) 


rV,, = B(1- 

The only difference from the model in the 
text is that the gain to accepting money is 
now V,,—V,-— 17. As before, there is always 
a nonmonetary equilibrium with II =0. 
There is a pure-monetary equilibrium if and 
only if V,—V,-—72=0 when evaluated at 
Il=1. Manipulating the above equations 
implies that this is true if and only if » <7, 
where 


M)x(1-x)(U-e) 
r+Bx(1-M) 


7 


Finally, there is a mixed-monetary equilib- 
rium if and only if V,.—V,—7=0 when 
evaluated at Il = €(0,1). Manipulating 
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the above equations implies that V,, —V, — 
7 = 0 if and only if 


M)x*(U-e) 
M)x(U-e-7) 


M 


Notice that 11M > 0 and II™ <1 if and only 
if <7. 

Hence, there exists the same set of three 
equilibria for any 7 in (0,7). In terms of 
Figure 2, an increase in 7 shifts the best- 
response correspondence to the right. For 
7 <7 there are still three intersections with 
the 45° line. For 7 > 7 there is only one, at 
Il = 0. Since 7 > 0, we can accommodate a 
positive transaction cost on money. Further- 
more, 7 can exceed ¢ (for example, as long 
as U is sufficiently large), and we can even 
accommodate a transaction cost on money 
that exceeds the transaction cost on real 
commodities. We can also use the split-the- 
surplus condition to determine M* and P* 
= M*/C endogenously for any given stock 
of nominal balances C. It can be shown that 
M* > 0, and therefore P* <~, if and only if 
—-&)/(r + Bx), which 
generalizes the condition for a finite price 
level given immediately after (13). 
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Monetary Policy and Credit Conditions: 
Evidence from the Composition of External Finance 


By Antu K Kasnyap, JEREMY C. STEIN, AND Davip W. Witcox* 


In this paper, we use the relative moments in bank loans and commercial paper 
to provide evidence on the existence of a loan-supply channel of monetary-policy 
transmission. We find that tighter monetary policy leads to a shift in firms’ mix 
of external financing: commercial paper issuance rises while bank loans fall. 
This suggests that contractionary policy can indeed reduce loan supply. Further- 
more, such shifts in loan supply seem to affect investment, even controlling for 
interest rates and output. (JEL E44, E52, G32) 


How does monetary policy affect the 
economy? The textbook /S-LM model holds 
that monetary policy operates through the 
liability side of banks’ balance sheets: when 
the Fed tightens policy by draining reserves 
from the banking system, the noteworthy 
consequence is a fall in the stock of money. 
An alternative view is that independent ef- 
fects come from the asset side of banks’ 
balance sheets (i.e., from bank loans). Ac- 
cording to this view, there are some borrow- 
ers for whom nonbank sources of credit do 
not represent a perfect substitute for bank 
loans. Therefore, when tight monetary pol- 
icy shrinks the size of the banking sector, it 
reduces the overall supply of loans to these 
“bank-dependent” borrowers. As a result, 
investment and aggregate demand fall by 


*Kashyap: Graduate School of Business, University 
of Chicago, Chicago, IL 60637, and National Bureau of 
Economic Research; Stein: MIT-Sloan School of Man- 
agement, Cambridge, MA 02139, and National Bureau 
of Economic Research; Wilcox: Federal Reserve Board, 
Washington, DC 20551. The authors are grateful to 
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national Financial Services Research Center, and Bat- 
terymarch Financial Management for financial and re- 
search support. The views in this paper are solely those 
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Board of Governors of the Federal Reserve System or 
its staff. 


more than can be accounted for by the 
conventional money channel.! 

A substantial body of empirical work seeks 
to distinguish between the “money” and 
“lending” theories of the transmission 
mechanism. Much of this work focuses on 
comparing the correlation between output 
and money to the correlation between out- 
put and loans.” This approach has its limita- 
tions, though, since it does not explicitly 
deal with the issue of endogeneity.? For 
example, a finding that output is more 
closely correlated with money than with 
loans does not necessarily mean that money 
is more important in a causal sense. It could 
instead be that money itself is endogenous 
and responds more strongly than do loans 
to exogenous output shocks. 

Two recent papers revisit this issue. 
Bernanke and Blinder (1992) and Christina 
Romer and David Romer (1990) both iden- 
tify what are arguably good indicators of 
exogenous shifts in Federal Reserve policy. 


'The idea that there may be an important lending 
channel has a long history. Early work on the subject 
includes James Tobin and William Brainard (1963) and 
Brainard (1964). For a recent exposition of the distinc- 
tion between the money and lending channels of policy 
transmission, see Ben Bernanke and Alan Blinder 
(1988). 

*For example, see Stephen King (1986), who exam- 
ines whether money is a better forecaster of future 
economic activity than lending. 

>Benjamin Friedman (1982, 1983, 1986) is an excep- 
tion in that he does a number of tests aimed at estab- 
lishing the exogeneity of his credit proxies. 
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The papers then go on to examine the im- 
pulse responses of money, loans, and vari- 
ous measures of economic activity to these 
indicators. Although different indicators are 
used (Bernanke and Blinder look at the 
federal funds rate and its spreads to Trea- 
suries, while Romer and Romer rely on 
their reading of the minutes of Federal 
Open Market Committee [FOMC] meetings 
to identify shifts to tighter policy), many of 
the qualitative conclusions are similar. When 
the Fed tightens, the money stock falls al- 
most immediately. Bank loans fall also, but 
with a significant lag: the decline does not 
begin to show up for 6-9 months. Output 
falls with a lag also and indeed seems to 
move roughly contemporaneously with 
loans. 

However, even if the indicators used in 
these papers are indeed good proxies for 
exogenous impulses to mouetary policy, an 
identification problem still remains in terms 
of deciding whether a distinct lending chan- 
nel is at work. Just because a fall in output 
coincides with a fall in loans does not estab- 
lish that the former was caused by the lat- 
ter. It is possible that the entire output 
response to the policy tightening was due to 
the conventional money channel and that 
the fall in the quantity of loans simply re- 
flects a decrease in loan demand (due to 
reduced output) and not a reduction in loan 
supply. 

In this paper, we bring new evidence to 
bear that allows for a clearer econometric 
identification of the lending channel of 
monetary-policy transmission. Rather than 
just looking at how bank assets and liabili- 
ties respond to policy impulses, we also 
focus on the behavior of an important sub- 
stitute for bank finance, namely commercial 
paper. The intuition for why such data can 
aid in achieving identification is simple. 
Suppose that monetary policy operated 
solely through a money channel, and that 
the fall in bank loans seen when the Fed 
tightens is due only to an output-induced 
effect on credit demand. Then one should 
expect the demand for nonbank sources of 
credit to decline as well, leading to a reduc- 
tion in, say, the volume of commercial pa- 
per issues. If, on the other hand, Fed tight- 
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ening reduces the supply of bank credit, one 
might expect an increase in commercial pa- 
per issuance, to the extent that businesses 
have some ability to substitute between the 
two sources of finance. 

In the next section of the paper, we for- 
malize the links between the stance of mon- 
etary policy and the composition of external 
finance with the aid of a simple model. The 
model highlights two basic conditions that 
are required for the existence of a lending 
channel and illustrates how data on non- 
bank sources of finance can be used to test 
each condition. Section II contains the first 
set of tests, and Section III contains the 
second set. 

In addition to being informative about 
the transmission of monetary policy, our 
results also shed new light on a statistical 
finding that has attracted a great deal of 
attention in macroeconomics recently, 
namely, that the spread between prime 
commercial paper (CP) rates and Treasury 
bill rates forecasts economic activity surpris- 
ingly well. As noted by Friedman and Ken- 
neth Kuttner (1992) and James Stock and 
Mark Watson (1989), the CP-bills spread 
has been a very powerful leading indicator. 

While the spread’s forecasting power ap- 
pears to be well established, the economics 
of why it has forecast so well are less clear. 
One common interpretation is that the 
spread simply reflects default risk and that 
this forward-looking property is what makes 
it a good predictor. In Section IV, we argue 
against this interpretation. We suggest in- 
stead that the CP-bills spread is a proxy for 
the stance of monetary policy: tight mone- 
tary policy leads to an increase in commer- 
cial paper issuance, which exerts upward 
pressure on paper rates.* If tight money 
eventually has an output effect, this effect 
will have been forecast by the movement in 
the CP-bills spread. 

As we explain in Section IV, the distinc- 
tion between these two hypotheses is a po- 
tentially important one. Our theory leads 
one to be skeptical about the extent to 


*Bernanke (1990) also argues that the spread re- 
flects monetary policy; he notes but does not stress the 
role of commercial paper issuance as we do here. 
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which the historical correlation between the 
spread and economic activity can be ex- 
pected to hold up: as the commercial paper 
market deepens, the price pressure gener- 
ated by a given Fed tightening should de- 
cline. At the same time, our theory suggests 
that data on the volume of commercial pa- 
per issuance may continue to be a useful 
leading indicator. We provide some simple 
forecasting comparisons between interest 
rate spreads and the financing mix which 
are consistent with this view. 


I. The Model 


The model we present below is highly 
stylized. Its aim is not to be either particu- 
larly realistic or rigorous, but rather to illus- 
trate simply the logic behind our empirical 
tests. The model is very similar to the one in 
Bernanke and Blinder (1988), with the one 
key difference being our more explicit focus 
on firms’ capital structure choices. The 
model highlights the conditions under which 
all distinctions between bank loans and se- 
curities can be ignored, as is done in the 
standard [S-LM model where there are only 
two financial assets: “money” and “bonds.” 
When these conditions are not satisfied, 
there are three assets—money, “bank 
loans,” and “securities’—which must be 
accounted for separately.° 

We begin by describing the investment 
and financing behavior of a representative 
firm. The firm invests an amount / and can 
choose between two sources of finance in 
raising the funds for the investment. A frac- 
tion a of the financing comes from bank 
loans at an interest rate r,, and the remain- 
ing fraction (1— a) comes from commercial 
paper at rate r,. In order to keep the model 
simple, we assume that all markets clear by 
price. However, when we turn to the empir- 
ical work, it will be important to recognize 


*Embedded in this simple version of the model is 
the (unrealistic) assumption that publicly traded corpo- 
rate and government securities are perfect substitutes. 
This is an assumption that we will implicitly be relaxing 
when we discuss the determinants of the spread 
between commercial paper and Treasury bills in Sec- 
tion IV. 
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that the “true” price of bank loans is imper- 
fectly observable; because of the widespread 
use of nonprice terms of credit (e.g., 
covenants, collateral, or the type of quantity 
rationing discussed in Joseph Stiglitz and 
Andrew Weiss [1981]), variables such as the 
prime rate may be a very noisy measure 
of r;. 

In addition to the direct interest cost, we 
assume that there is a “relationship” benefit 
R to bank borrowing that depends on the 
total amount borrowed from the bank: 


(1) R= If(a) 


where f(-) is an increasing concave func- 
tion. 

We do not explicitly model the sources of 
this relationship benefit here. However, 
there are several theories that can explain 
why firms with an option to do so might 
choose to finance themselves at least par- 
tially with higher-rate bank loans. For one, 
a banking relationship is likely to involve 
more monitoring and hence a lesser degree 
of informational asymmetry between bor- 
rowers and lenders. In this sense, a banking 
relationship can function like an extended 
internal capital market, allowing firms to 
finance investments even when adverse- 
selection problems make it difficult to raise 
funds from the public markets.° A banking 
relationship may also help to reduce the 
costs of financial distress: the free-rider 
problems associated with continued funding 
of a distressed firm are likely to be less 
severe than in the case when creditors are 
all dispersed security-holders. ’ 

Given our assumptions, the firm’s optimal 
financing mix, which we denote by a”, is 
given by 


(2) a* = F(rz— rp) 


For an example of theoretical work on the benefits 
of bank monitoring, see Douglas Diamond (1984, 1991). 
Takeo Hoshi et al. (1991) find empirical support for the 
idea that banking relationships can be valuable, be- 
cause they help to make firms’ investment less depen- 
dent on their internal cash flow. See also Eugene Fama 
(1985). 

7Hoshi et al. (1990) and Stuart Gilson et al. (1990) 
present evidence consistent with this view. 
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where F(-)= f’ '(-) is a decreasing func- 
tion. Equation (2) contains our first impor- 
tant insight. It establishes that any shock 
(e.g., monetary policy) that disturbs the rel- 
ative costs of loans and paper will be re- 
flected in shifts in firms’ financing mix.® 
Thus, even if the “true” r, is difficult to 
observe accurately in practice, we can use 
data on firms’ financing choices to infer 
something about the state of loan supply.’ 

We now close the model in the simplest 
possible fashion. Given the firm’s choice of 
capital structure, its net cost of capital (in- 
clusive of the relationship benefit of bank 
loans), k, is 


(3) k=r,+a*(r;—r,)— f(a*). 


Investment demand is assumed to depend 
on both k and aggregate output Y:'° 


(4) 1=1°(Y,k). 


‘In our formulation, a change in the relative cost of 
loans and paper is the only thing that alters the mix. 
Diamond (1991) presents a model in which the mix can 
be changed due to movements in the relative demand 
schedules for loans and paper. Diamond’s model has 
the feature that R is effectively higher when real 
interest rates are higher, implying that the mix will rise 
in the wake of a monetary contraction. Therefore, the 
considerations in Diamond’s basic model suggest that 
our tests will be excessively conservative; it will be 
harder for us to find the loan-supply effects we are 
looking for. 

There is another reason why looking at quantity 
financing variables can add value relative to using 
interest rate spreads as a proxy for (r, —r,): the ob- 
served rate spread may be affected by other factors 
outside our model, such as changing default probabili- 
ties. For example, it may be that in a recession the 
likelihood of default rises more for the small compa- 
nies who are bank borrowers than for the large compa- 
nies who are commercial paper borrowers. If this is the 
case, the observed rate spread will widen even in the 
absence of any change in the “true” (net of default) 
costs of loans and paper. This sort of measurement 
error could be particularly problematic, as it is likely to 
be correlated with many of the business-cycle variables 
that we will be studying. 

The model is atemporal so investment and the 
capital stock coincide. Using standard adjustment-costs 
arguments, we could introduce dynamics and transform 
the demand for capital into a model for investment. 
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Aggregate output, in turn, is the sum 
of investment and other autonomous de- 
mand:* 


Y=/+G. 


Interest rates on both paper and bank 
loans are determined by monetary policy. 
To keep things as simple as possible, we 
assume that banks are financed completely 
with demand deposits, hold cash to satisfy 
reserve requirements, and hold no excess 
reserves. Thus the quantity of money, M, is 
a sufficient statistic for the stance of mone- 
tary policy.'? Money affects the interest rate 
on nonbank paper through the usual LM- 
type relationship: 


(6) r,=H(Y,M). 


Money can also affect the interest rate on 
bank loans if banks have nondegenerate 
portfolio preferences across loans and pa- 
per as assets on their balance sheets. We 
assume that banks wish to hold a fraction of 
their assets in loans and that this fraction 
depends on the spread between the rate on 
loans and the rate on paper. Thus, loan 
supply L* is given by 


(7) 


where J(-) is an increasing function. Note 
that since we allow banks to cushion the 
impact of monetary policy on loans by carry- 
ing out portfolio adjustments on the asset 
side of their balance sheets, there is no real 
loss of generality in assuming that they are 
inflexible with respect to the structure of 
their liabilities. 

Equating loan supply to loan demand 
yields 


(8) a*1=J(r,—1,)M. 


‘For simplicity, we do not model consumption be- 
havior. 

‘This is an obviously unrealistic assumption, and it 
is made for expository purposes only. Indeed, in the 
empirical work that follows, we use indicators other 
than the quantity of money to gauge the stance of 
monetary policy. 
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The six equations, (2)-(6) and (8), deter- 
mine the six endogenous variables Y, J, k, 
Tg, Tp» and a* in terms of the exogenous 
parameters G and M. It is clear from these 
equations that there are two necessary con- 
ditions that must both be satisfied if mone- 
tary policy is to impact aggregate demand in 
part through a distinct lending channel: 


(i) Loans and paper must be imperfect 
substitutes as bank assets. That is, banks 
must not react to a contraction in size 
simply by reducing their holdings of pa- 
per and leaving loan supply unchanged. 

(ii) Loans and paper must also be imperfect 
substitutes as corporate liabilities. Firms 
must not be able to offset costlessly a 
reduction in bank loan supply by issuing 
more paper. 


The next two sections of the paper are 
devoted to testing each of these necessary 
conditions in turn. In both cases, the model 
helps us to see how data on either the 
volume of nonbank paper or the ratio of 
bank loans to total financing can help in 
creating unambiguous tests. 

To test condition (i), we investigate how 
paper volume and the ratio of loans to total 
financing respond to a monetary-policy im- 
pulse. That this provides a sharper test of 
condition (i) than simply looking at the re- 
sponse of loan volume to a monetary im- 
pulse can be seen by computing the follow- 
ing three derivatives: 


(9) d(loans) /dM 

= a*dI/dM + Ida*/dM 
(10) d(paper)/dM 

= (1—a*) dl /dM — Ida*/dM 
(11) da*/dM = F'd(r,—r,)/dM. 

According to equation (11), the financing 

ratio can vary with money only if condition 
(i) holds. If condition (i) does not hold, and 
loans and paper are perfect substitutes as 


bank assets, then the rates on both will be 
equalized, and a* will not be affected by 
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monetary policy. Thus, looking at how a* 
moves with M can be informative about 
whether condition (i) holds.'? The same 
cannot be said for loans. Equation (9) makes 
it clear that even if there is perfect substi- 
tutability (so that da*/dM =0), loans will 
be positively correlated with money due to 
their positive correlation with the level of 
investment. This is the source of the identi- 
fication problem noted in the Introduction. 

The responsiveness of paper volume to a 
monetary impulse provides an even stricter 
test of condition (i) than looking at a*. As 
can be seen from equation (10), a monetary 
tightening has two opposing effects on pa- 
per volume. On the one hand, it leads to a 
decline in investment that reduces the de- 
mand for all types of financing. On the 
other hand, it may lead to a substitution 
away from loans and toward paper, thereby 
increasing paper volume. Thus the correla- 
tion between paper and money will be nega- 
tive only if the effect of M on a%* is strong 
enough to overwhelm the positive effect of 
money on output and investment. 

Our tests of condition (ii) are motivated 
by the observation that, in a Modigliani- 
Miller world where firms can costlessly sub- 
stitute between bank and nonbank sources 
of finance, their financing choices (as mea- 
sured by a*) should be uninformative about 
investment activity. Thus, we can test condi- 
tion (ii) by examining the relationship be- 
tween a* and various measures of invest- 
ment. 

The rationale for our tests of condition 
(ii) can be seen more formally by consider- 
ing the total derivative of investment: 


(12) dIl=I,dy + I,dr, + I,a*(dr, —dr,). 
In a world where bank loans and paper are 


We also check directly to see how an interest- 
rate-based proxy for r,—r,, (the spread between the 
prime rate and the CP rate) responds to a monetary 
policy shock. To the extent that this proxy is an accu- 
rate one, we would expect to obtain similar results 
using both it and the financing ratio. However, as 
emphasized above, there are various potential 
measurement-error problems associated with using an 
observed rate spread as a proxy for the “true” cost 
differential between loans and paper. 
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perfect substitutes, the third term on the 
right-hand side of (12) drops out, and in- 
vestment depends only on output and the 
security-market interest rate. However, with 
imperfect substitutability, the spread be- 
tween loan and paper costs also affects in- 
vestment. In testing condition (ii), we esti- 
mate investment equations similar in form 
to (12), making use of changes in a* to 
capture changes in the spread between loan 
and paper costs.'* 

Before proceeding, we ought to comment 
on the homogeneity assumption implicit in 
our model, wherein a single representative 
firm faces a meaningful trade-off between 
bank loans and commercial paper. More 
realistically, there will be some heterogene- 
ity: some larger firms may already be fi- 
nancing themselves solely from nonbank 
sources, and some small firms may be un- 
able ever to access the securities markets. 
This leaves only some firms in the middle 
facing a meaningful trade-off. 

Such heterogeneity will probably make it 
harder for us to find the evidence we are 
looking for. To see why, consider the polar 
case in which all firms are either completely 
in or completely out of the paper market, so 
that there are no firms facing a trade-off at 
the margin. In such a case, monetary policy 
will indeed have lending-channel effects—it 
will lead to a credit crunch for bank-depen- 
dent firms—but commercial paper issuance 
will not increase, because the bank-depen- 
dent firms cannot move into the paper mar- 
ket. Thus, heterogeneity cannot “explain 
away” a finding that paper issuance rises in 
the wake of a monetary tightening. If any- 
thing, it makes such a finding all the more 
striking. 


II. Monetary Policy and the Composition 
of External Finance 


In this section, we look at how the vol- 
ume of bank loans and commercial paper 


4a gain, we also use the prime—CP spread as a 
proxy for r,—r, in our investment equations. The 
same caveats discussed above apply. 

This polar version of the model seems to be what 
Bernanke and Blinder (1988) have in mind. 
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outstanding respond to changes in the stance 
of monetary policy. Unfortunately, measur- 
ing the stance of monetary policy is in itself 
not a straightforward task. Given that any 
single measure is imperfect and subject to 
criticism, we use several different monetary- 
policy indicators which have been suggested 
in the literature. 

In our first set of tests, we follow the 
technique of Romer and Romer (1990), who 
read the minutes of the FOMC and identify 
six dates since World War II when the Fed 
appears to have opted for a clear shift to 
tighter policy. These “Romer dates” are: 
October 1947, September 1955, December 
1968, April 1974, August 1978, and October 
1979. Thus, one simple experiment is to 
compare the behavior of commercial paper 
and bank loan volume before and after 
Romer dates.'® An increase in commercial 
paper volume subsequent to a Romer date 
would constitute evidence of a constriction 
in bank loan supply. 

Although intuitively appealing, the Ro- 
mer-date approach, by focusing on only a 
handful of extreme episodes, may sacrifice 
valuable information. To recapture some of 
that information, we also use data from the 
federal funds market. Bernanke and Blin- 
der (1992) argue persuasively that the fed- 
eral funds rate, as well as the spread be- 
tween federal funds and Treasury bonds, 
are good indicators of monetary policy. Our 
second set of tests thus examines the corre- 


'°A couple of questions have been raised concerning 
the use of Romer dates as proxies for monetary policy. 
First, it is difficult to make an airtight case for their 
exogeneity; the Fed may be endogenously responding 
to changes in activity when it opts to tighten policy. 
However, we believe that this presents less of a prob- 
lem for our use of the Romer dates than it might in 
other applications. For example, the endogeneity issue 
might be problematic if one were trying to argue that 
monetary policy has a causal effect on output. If a 
Romer date endogenously tends to follow temporarily 
elevated GNP, it might be incorrect then to infer that 
the tightening causes any subsequent decline in GNP. 
However, we think it is much less likely that Fed policy 
responds endogenously to fluctuations in variables such 
as the financing mix. 

A second issue is the Romer dating criterion’s pc- 
tential omission of other significant episodes of policy 
tightening, such as the 1966 “credit crunch.” We shall 
discuss this issue shortly. 


84 


A. COMMERCIAL PAPER 


THE AMERICAN ECONOMIC REVIEW 


MARCH 1993 


1965 1969 1973 


B. COMMERCIAL BANK LOANS 


1977 


300 


1981 1985 1989 


600 
450 


— 150 


1969 1973 


1977 


1981 1985 1989 
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FicurE 1. THE COMPOSITION OF SHORT-TERM EXTERNAL FINANCE 
(FLow-oF-Funps Data, QUARTERLY, 1963-1989) 


Notes: Solid vertical lines mark Romer dates; the dotted line represents the 1966 
credit crunch. Nominal values are deflated using the GNP implicit deflator. Commer- 
cial paper and bank loan values are in billions of 1982 dollars. 


lation between both of these interest rate 
indicators and the financing variables. 
Figure 1 presents the basic financing data 
that will be analyzed in this section: the log 
of the real amount of commercial paper 
issued by nonfinancial domestic corpora- 
tions (“commercial paper” hereafter); the 


log of the real amount of commercial bank 
loans made to businesses (“bank loans” 
hereafter); and the ratio of the bank loans 
to the sum of bank loans and commercial 
paper (“mix” hereafter). These data, along 
with all the other data used in the paper, 
are described in the Data Appendix. The 
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mix variable is an empirical proxy for the 
quantity a. The solid vertical lines in the 
chart denote the dates identified by the 
Romers as the onset of monetary tighten- 
ing. 

The dashed vertical line marks the begin- 
ning of the 1966 “credit crunch.” Although 
this episode was not singled out by the 
Romers, it is widely regarded as one of the 
most significant periods of tight credit in the 
post-World War II period. For example, 
Philip Cagan (1972) notes that monetary 
growth was flat over the last three quarters 
of 1966. Moreover, he argues that the deci- 
sion made by the FOMC on May 10 of that 
year, to restrict reserve growth until deposit 
growth slowed, represented a_ significant 
tightening of policy. In light of this ambigu- 
ity, the formal statistical tests that we will 
present shortly are done two ways: both 
with and without the 1966 credit crunch 
included in our list of tightening episodes. 

The figure illustrates the phenomenal 
growth of the commercial paper market over 
the last 25 years. As the mix variable shows, 
commercial paper moved from being a triv- 
ial fraction of financing (less than 2 percent) 
to becoming a large source of financing. 

Figure 2 focuses on the movements in 
commercial paper, bank loans, and the mix 
in the year before and three years following 
each of the Romer dates (and the 1966 
credit crunch). For comparability across 
episodes, each series is normalized to equal 
zero as of the date of the monetary tighten- 
ing. For reference, the straight line in each 
graph shows the sample average trend in 
each series. The average across the Romer 
episodes (not including the 1966 episode) is 
given by the bold line in each panel. 

Panel A of Figure 2 confirms that across 
all of the episodes, paper grows at or above 
trend over the first year following the focal 
dates. The 1966 experience is particularly 
noteworthy, as paper volume soared over 
the last half of 1966 and the beginning of 
1967. Interestingly, many analysts studying 
this period explain the boom in commercial 
paper issuance using exactly the sort of logic 
discussed above: they argue that the paper 
market took off when high-quality borrow- 
ers seeking loans were turned away by their 
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bankers. For instance, Timothy Rowe (1986, 
p. 119) reports that during the 1966 credit 
crunch, “many potential borrowers who for- 
merly relied on bank short-term credit were 
forced to turn to the commercial paper 
market. Consequently, the annual growth 
rate of commercial paper outstanding rose 
from 7.8 percent in 1965 to 46.6 percent in 
1966.” 

Returning to the other episodes, the rapid 
growth in paper tends to abate over the 
second and third years following the focal 
date. Panel B of Figure 2 shows that there 
is relatively little action in bank loans imme- 
diately following the Romer dates: as docu- 
mented by Romer and Romer (1990), bank 
loans grow at about trend rates for the first 
few quarters after a monetary contraction. 
Subsequently, loan growth begins to slow. 
Bernanke and Blinder (1992) argue that this 
pattern should be expected since loan com- 
mitments and other implicit arrangements 
prevent banks from quickly shifting their 
portfolios in response to a shock. By two 
full years after each of the Romer dates, 
loan volume is typically below trend.'’ 

The behavior of the mix is as would be 
expected from the behavior of its individual 
components. This can be seen in panel C of 
Figure 2. The mix declines noticeably within 
the first year after a Romer date, and by the 
third year it is still below the trend line. 
Notice again that the 1966 episode stands 
out as a period of dramatic shifting away 
from bank financing and toward commercial 
paper financing. 

In panel A of Table 1, we attempt to 
quantify the statistical significance of these 
effects. We begin by creating two dummy 
variables. The first, labeled the “Romer 
dates” dummy, equals 1 on a Romer date, 
and 0 otherwise. The second, labeled the 
“Romer dates plus 1966” dummy, also 
equals | at the onset of the 1966 credit 
crunch. We then do Granger causality tests 
to see if movements in either of these vari- 
ables help forecast movements in firms’ fi- 
nancing choices. We do the tests in two 


"The average decline in bank loans is largely at- 
tributable to the 1974 episode, as can be seen from 
Figure 2. 


A. COMMERCIAL PAPER 


THE AMERICAN ECONOMIC REVIEW 


MARCH 1993 


0.6 


B. COMMERCIAL BANK LOANS 


0.6 


0.2 


1 2 
1978:3 we 


| 


: 
~1974:2 


0.3 


FIGURE 2. CHANGES IN SHORT-TERM EXTERNAL FINANCE AROUND ROMER DATES 


Notes: All series are shown as log deviations from their values as of the Romer dates. 
Ticks along the x-axis mark quarters. Averages across Romer dates, excluding the 
1966 episode, are shown by the bold lines in each panel. Average growth over 
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basic ways. In the “bivariate” version of the 
tests, we regress the change in the financing 
variable (e.g., the mix) on eight lags of itself 
and on eight lags of the dummy variable. In 
the “multivariate” version, we add eight 
lags of GNP growth to the equation, in an 
effort to control for cyclical factors other 


than monetary policy which might conceiv- 
ably affect the financing variables. Thus for 
each financing variable (mix, paper, and 
loans) we conduct four separate tests. 

The results in panel A of Table 1 indicate 
that many of the patterns displayed in Fig- 
ure 2 are indeed statistically significant, in 
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C. BANK LOANS AS A FRACTION OF TOTAL SHORT-TERM EXTERNAL FINANCE 


Ficure 2. (Continued ) 


TaBLeE 1—Causa.ity Tests BETWEEN FINANCING 


VARIABLES AND INDICATORS 


OF THE STANCE OF MONETARY POLICY 


Mix 


Log real paper 


Log real loans Prime—CP spread 


Indicator 


Exclusion Summation Exclusion Summation Exclusion Summation Exclusion Summation 


A. Romer Date Indicators: 


Romer dates (bivariate) 

Romer dates plus 1966 
(bivariate) 

Romer dates (muliivariate) 

Romer dates plus 1966 
(multivariate) 


B. Interest Rate Indicators: 


Funds rate (bivariate) 0.003 
Spread: 10-year government 


0.03 


0.06 


0.0002 


0.002 
0.0009 


0.02 


— 1.46 


0.001 4.10 


bond-—funds (bivariate) 
Funds rate (multivariate) 
Spread: 10-year government 

bond-funds (multivariate) 


0.14 
0.20 


1.61] 
—6.15 


0.003 
0.002 


— 1.48 


0.007 2.44 


0.04 0.10 2.73 0.99 — 0.48 0.028 


Notes: In each case, the variable at the top of the column is regressed against eight lags of itself and eight lags of the 
variable in the row. In the multivariate cases, eight lags of GNP growth are added to the regression. The following variables 
have been differenced so that they enter the regressions in stationary form: mix, log real paper, log real loans, prime—CP 
spread, and funds rate. Entries in the ‘“‘exclusion’”’ columns report the marginal significance levels on tests that variables 
shown in rows do not help forecast the variable at the top of the column. Entries in the “summation” columns report the ¢ 


statistic for the test that the sum of the coefficients on the eight lags of the variable shown in the row is zero. 


spite of the small number of observations. 
The sum of the coefficients on the dummy 
variable is significantly negative in all four 
of the mix regressions. Thus, even without 
including the 1966 episode, we find that the 
mix falls significantly after a Romer date. 
The results for the individual components 
of the mix (i.e., ioans and commercial pa- 


per) are somewhat less uniform, although 
they too always go in the predicted direc- 
tion. The sum of the coefficients is positive 
in all four of the regressions involving com- 
mercial paper, but the results are statisti- 
cally significant only when the 1966 credit 
crunch is included. In the loan regressions, 
the coefficients are always negative, but they 


87 
0.025 
+ 
0 
. 
0.025 
1979:4 
/ 
1978-3 0.050 
0.09 2.10 0.84 0.17 —2.16 a 2.17 
0.08 — 2.55 0.13 0.07 — 1.99 ma 2.27 
0.02 -2.91 0.57 0.45 — 1.02 2.35 
0.05 3.02 0.07 0.12 —1.18 Ls 1.98 
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are only significant in the bivariate specifi- 
cations. Thus, while loans appear to decline 
in the eight quarters following a monetary 
contraction, this decline is not significantly 
greater than would be expected given the 
weakening of GNP that occurs in this pe- 
riod. 

Finally, we also tested to see whether the 
spread between the prime rate and the 
commercial paper rate widens after the Fed 
tightens. If the sorts of measurement-error 
problems discussed above are not too se- 
vere, one would expect the behavior of this 
spread to be closely related to that of the 
mix. Interestingly, the last column of Table 
1 suggests that this is indeed the case: in all 
four specifications, we observe a statistically 
significant increase in the prime—CP spread 
subsequent to a monetary contraction. 

Next, we turn to the two interest-rate- 
based measures of monetary policy pro- 
posed by Bernanke and Blinder (1992): the 
federal funds rate (Bernanke and Blinder’s 
preferred proxy for monetary policy) and 
the difference between the 10-year con- 
stant-maturity government bond rate and 
the funds rate. We use exactly the same 
format as above, simply replacing the two 
dummy-variable series in the regression tests 
with these two interest-rate series.'* As be- 
fore, we conduct both bivariate and multi- 
variate tests, so that we again have four 
separate tests for each financing variable. 

The results of these tests are reported in 
panel B of Table 1, and they parallel those 
in panel A quite closely. Most notably, the 
sum of the coefficients on the monetary- 
policy indicator is again statistically signifi- 
cant (and of the predicted sign) in all four 
of the specifications involving the mix vari- 
able: the mix falls significantly when either 
the funds rate rises or when the (bond- 
funds) spread shrinks. 


'8We actually used the change in the federal funds 
rate in the regressions, as the funds rate itself is 
nonstationary. Thus, all the tests in panels A and B of 
Table 1 involve differencing every nonstationary vari- 
able. However, we have also rerun all the tests with an 
alternative specification in which the nonstationary 
variables enter in level form, but in which a time trend 
is added to the right-hand side of the equation. This 
alternative specification produces results very similar to 
those we report here. 
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Commercial paper issuance goes up when 
either of the interest rate indicators points 
to tighter monetary policy, although here 
(as in panel A) the statistical significance 
depends somewhat on the specification em- 
ployed. The results for loans are the weak- 
est: they are of marginal significance in the 
bivariate tests and are completely insignifi- 
cant in the multivariate tests. Thus, as in 
panel A, the evidence suggests that the de- 
cline in loans in the first eight quarters after 
a monetary tightening is not much greater 
than would be expected given the weaken- 
ing of GNP that occurs in this period. 

Finally, as in panel A, the results in panel 
B also suggest that the (prime—CP) spread 
widens in the wake of a monetary contrac- 
tion. 

Thus far, we have looked only at the 
commercial paper market for evidence on 
the behavior of nonbank sources of finance. 
While commercial paper may be the closest 
substitute for bank loans, our narrow focus 
on this market nonetheless raises a ques- 
tion: what if the increases in commercial 
paper volume that we document do not 
represent substitutions away from bank 
loans, but rather substitutions away from 
other nonbank sources of finance? If this 
were the case, it would be inappropriate for 
us to interpret our results as evidence of a 
reduction in bank loan supply. 

Ideally, we would tackle this issue by ana- 
lyzing the movements in all major nonbank 
sources of finance. Commercial paper and 
trade credit stand out as the leading short- 
term substitutes for bank loans. Unfortu- 
nately, however, the standard aggregate data 
on gross trade credit is not appropriate for 
our purposes, since gross trade credit does 
not represent a net source of financing to 
the corporate 

Data availability is less of a problem with 
respect to long-term financing instruments. 


The lending view of monetary-policy transmission 
has implications for the pattern of interfirm trade 
credit flows, rather than for the aggregate quantity of 
gross trade credit. For example, if loan supply is con- 
stricted, we might expect to see larger firms with access 
to public debt markets extending more trade credit to 
smaller firms, an argument made by Allan Meltzer 
(1960). Pursuing this issue requires firm-level data and 
is beyond the scope of this paper. 


3 
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We are not aware of any good theoretical 
reasons to believe that a substitution from 
bonds (or other long-term sources of fi- 
nancing) to commercial paper should follow 
a monetary contraction.”” Nevertheless, it 
seems worth checking out this possibility 
empirically. To do so, we looked at new 
issues of high-grade industrial bonds and 
performed tests similar to those done above 
for the other financing variables. The results 
do not support the notion that a monetary 
contraction leads to a swap from bonds to 
paper. For example, we find a positive (al- 
beit statistically insignificant) correlation be- 
tween the federal funds rate and bond is- 
suance. In other words, bonds typically move 
in the same direction as paper after the Fed 
tightens. This strengthens the case for be- 
lieving that the movements seen in commer- 
cial paper volume are due to loan-supply 
effects rather than to relative shifts in the 
demand for alternative sources of nonbank 
finance. 


III. The Composition of External Finance 
and Real Activity 


We now turn to an investigation of 
whether condition (ii)—imperfect substi- 
tutability by corporations between loans and 
paper—is satisfied. Our tests are motivated 
by the observation that in a Modigliani- 
Miller world where businesses can costlessly 
substitute between bank and nonbank 
sources of finance, their financing choices 
should be uninformative about investment 
activity. In contrast, our model suggests that 
data on financing choices should have some 
predictive power for investment. 

Our basic approach is to ask whether the 
financing mix adds significant explanatory 
power to investment equations that already 
include the usual interest rate variables. If 
we did not control for interest rates, our 


For example, a simple expectational model of the 


term structure provides no theoretical guidance as to 
how changes in monetary policy and the accompanying 
interest rate changes should affect firms’ choice of debt 
maturity. 
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results would not be able to verify the exis- 
tence of an independent lending channel of 
transmission. For example, an uncondi- 
tional correlation between the mix and real 
activity could arise if monetary policy moved 
the mix around [i.e., condition (i) was satis- 
fied] but influenced the real economy solely 
through its effect on security-market inter- 
est rates [i.e., condition (ii) failed to hold].*! 

Unfortunately, the empirical success of 
most investment and inventory models is 
open to question. Furthermore, simultaneity 
and feedback relationships tend to make 
the results from reduced-form models such 
as VAR’s difficult to interpret. We proceed 
by examining several “off-the-shelf” struc- 
tural models for inventories and investment. 
The advantage of this approach is that it 
makes the interpretation of the results much 
more straightforward. The cost is that one 
must be more concrete in specifying how 
the economy operates. To avoid being tied 
to one point of view on this question, we 
use a host of models. 

In the case of fixed investment, there are 
several well-known empirical models. We 
examine three: the “accelerator,” the 
“neoclassical,” and the securities-value (or 
“O”) models. Our specifications follow di- 
rectly from the work of Peter Clark (1979), 
who conducts a detailed comparison of these 
models as well as a couple of others. The 
first two models, the neoclassical and the 
accelerator, can be motivated from a 
partial-adjustment mechanism. If we denote 
the desired capital stock by K*, the acceler- 
ator model is derived by assuming that this 
desired level is proportional to the current 
level of output, Y, and that some costs of 
adjustment impede firms from instanta- 
neously equating their actual capital stock 
with their desired stock. Given the slow 


“This is not to imply that unconditional correla- 
tions between the mix and real activity are uninterest- 
ing. Indeed, we examine such correlations in the next 
section and compare them to the well-documented 
correlations between the CP-bills spread and real ac- 
tivity. Our only point here is that unconditional corre- 
lations are inappropriate for drawing a clear distinction 
between money and credit channels. 


¢ 
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adjustment, net investment J will be given 
by 


N 


s=0 


(13) 


N 
B,AY,_, 


s=0 


where A represents the first-difference op- 
erator and N is the length of the dis- 
tributed lag. To model gross investment, J, 
we assume that replacement demand is pro- 
portional to the lagged capital stock. Fur- 
thermore, we follow Clark (1979) in normal- 
izing all variables by potential output, YP, 
to correct for heteroscedasticity, and in as- 
suming that the stochastic disturbance in 
the equation exhibits first-order serial corre- 
lation.2” Therefore the specification of the 
Clark accelerator equation is 


I 


— 


The Jorgenson-style neoclassical model 
that we consider is obtained by postulating 
that the desired capital stock is the follow- 
ing linear function of output: 


15 

(15) 

where p is the price of output and C is the 
rental price of capital services. This formula 
can be derived by assuming that output is 
produced competitively and that the pro- 
duction function is Cobb-Douglas. In this 


2 Without scaling, the residuals from the equation 
seem to be quite heteroscedastic. In a large sample, the 
scaling should only help to correct the heteroscedastic- 
ity and not otherwise affect the equation. We find that 
the scaling not only helps to correct the heteroscedas- 
ticity, but also makes the impact of the mix more 
pronounced in our augmented equations. 
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case, the parameter y is the share of capital 
in output. On theoretical grounds, many 
prefer the neoclassical model over the ac- 
celerator model since it introduces a depen- 
dency between investment and the cost of 
capital. Given the earlier assumptions re- 
garding replacement investment, hetero- 
scedasticity, and serial correlation, the spec- 
ification Clark (1979) analyzes is: 


pY 
I £ 
16) =—~+ | ———— 
5K, _, 
+ +u, 
YP 


U, = pu,_| + €;. 


We also experimented with Clark’s ver- 
sion of Bischoff’s modified neoclassical 
model, which allows for different coeffi- 
cients on output and the cost of capital. The 
results were very similar to the results from 
the simple neoclassical specification and 
thus are omitted. 

Our final model, the Q model, is based 
on the assumption that if the market value 
of a firm exceeds the replacement cost of its 
assets, then it should seek to expand. Our 
specification, again taken from Clark (1979), 
is 


I N 
(17) 


t-1 s=0 


u, = pu,_, + €, 


where Q is the market value of the firm 
(corrected for the effect of taxes) divided by 
the replacement cost of the firm. Together 
these three models roughly encompass most 
of the structural equations that have been 
proposed for studying investment. 

There is even less consensus on how to 
handle inventories. We follow Louis Mac- 
cini and Robert Rossana (1981) and embed 
the role of interest rates in a partial-adjust- 
ment model. As Blinder and Maccini (1991) 


| 
| 
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note, although this strategy has been rela- 
tively unsuccessful in finding interest rate 
effects, it is virtually the only tractable alter- 
native and therefore has been widely used. 
Letting H be the actual stock of inventories 
and H* be the desired stock of inventories, 
Maccini and Rossana begin with 


(18) 


A log( H,) = A[log( H;*) —log( H,_,)] 


(19) log( =u + Blog(S;) + yr? 


where S° and r° are expected sales and 
carrying costs, the factors that are typically 
assumed to be the key determinants of de- 
sired inventories.> Maccini and Rossana 
follow the standard practice of using dis- 
tributed lags to proxy for the expected value 
of sales and carrying costs. Thus, after sub- 
stituting to eliminate the desired target and 
allowing for serial correlation, the equation 
they consider is 


(20) Alog(H,) 


N, 


B, log( S,_,) 


s=1 


+ y,r,_,+ dlog( H,_,)+u, 


s=1 


u,= pu,_| + 


where S and r are actual sales and interest 
costs. We use two variants of this equation. 
The first, which we label the “accelerator” 
model, contains only the terms pertaining to 
expected sales. The second, labeled “neo- 
classical” in what follows, includes both the 
sales and interest-expense proxies. 

Before the equations can be estimated 
several other details must be finalized. First, 
we need to identify the lag lengths that will 


*3Maccini and Rossana (1981) also experiment with 
allowing inventories to depend on other factors such as 
materials costs and wages. Those extensions had little 
effect on the importance of interest rate effects, so we 
suppress them. 
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be used in the estimation. For the invest- 
ment equations we follow Clark (1979) by 
using quarterly data and allowing for 20 lags 
on the output and cost-of-capital measures, 
but restricting the lag coefficients to lie on a 
sixth-order polynomial. In the inventory 
equations we follow Maccini and Rossana 
(1981) by allowing for 24 lags of sales and 20 
lags for the inventory carrying costs and 
restricting these coefficients to lie on a 
third-order polynomial. (Limited experi- 
mentation suggested that the results were 
robust with respect to both of these choices.) 

Second, we must specify which inventory 
and investment series will be used. For the 
investment series, we follow the standard 
convention in the literature and distinguish 
between investment in equipment and in- 
vestment in nonresidential structures. For 
inventories, we follow the distinction made 
in the national income and product ac- 
counts and isolate inventories held by man- 
ufacturers, wholesalers, and retailers of 
nondurable goods and inventories of manu- 
facturers and wholesalers of durable 
goods.”* For convenience, we refer to these 
as nondurables and durables inventories, re- 
spectively. 

Finally, to test whether the mix has any 
additional explanatory power, we added the 
mix to each equation with a specification 
symmetric to that for interest rates. For the 
investment equations this amounts to adding 
20 lags of the mix with the coefficients con- 
strained to lie on a sixth-order polynomial, 
while for inventories this strategy suggests 
allowing for 20 lags of the mix with the 
coefficients constrained to lie on a third- 
order polynomial. (We obtained fairly simi- 
lar results when we instead added four lags 
of the mix to the equations and did not 
restrict the coefficients.) The equations are 
estimated over the 1964-1988 period.” 


°4Thus we are ignoring retail durable-goods inven- 
tories. This category is mostly automobiles and is rela- 
tively small compared to the other two categories we 
consider. 

The sample period was chosen to correspond to 
the availability of the data for Q, which was available 
only through 1988. 
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TABLE 2—TESTs FOR EXPLANATORY POWER OF MIX AND PRIME—CP SPREAD 
IN STRUCTURAL INVENTORY AND INVESTMENT EQUATIONS 


Mix Prime—CP spread 
Neoclassical 


Category Accelerator Accelerator Neoclassical 


Nondurable inventories 0.173 0.009 — 0.022 0.001 
Durable inventories 0.002 0.049 — 0.500 0.302 
Producer’s durable equipment 10° 0.003 0.026 10s? 0.003 
Nonresidential structures 0.460 0.391 0.175 0.537 0.011 


0.027 
0.037 


Notes: The number in each cell is the significance level in an F test that tests whether a distributed lag of either the 
mix or the prime—CP spread enters the inventory and investment equations described in the text. “Nondurable 
inventories” are inventories held by manufacturers, wholesalers, and retailers of nondurable goods; ‘‘durable 


inventories” are inventories held by manufacturers and wholesalers of durable goods. 


In Table 2 we report the significance lev- 
els for the F tests of whether all the coef- 
ficients on the mix are jointly equal to zero.”° 
The results suggest that the mix is an impor- 
tant determinant of inventories; the mix 
shows up very strongly for the durables in- 
ventories and moderately strongly for the 
nondurables inventories. The shapes of the 
lag distributions imply that the mix has the 
expected effect, at least over the first couple 
of years: a declining loan share is associated 
with falling inventories. Given that the long- 
run output effects of monetary policy would 
be predicted to be small, it is also not 
surprising to find that the long-run impact 
of a shift in the mix is insignificantly differ- 
ent from zero. 

These results—that inventories can be in- 
fluenced by financial factors—are particu- 
larly interesting in light of two stylized facts. 
First, interest rate effects have been hard to 
find in standard inventory models. Second, 
there is a very tight positive correlation be- 
tween bank loans and inventories (see e.g., 


26The tests are carried out in terms of the level of 
the mix. The arguments in Jeffrey Wooldridge (1990) 
suggest that standard asymptotics apply, provided that 
the mix, conditional on the other right-hand-side vari- 
ables in the equations is stationary. This condition is 
easily satisfied in all of the specifications aside from the 
Q equations, where it is marginally satisfied. Nonethe- 
less, to ensure that our small-sample results are not 
due to any leftover time trends in the investment series 
that are correlated with the trend in the mix, we reran 
all the tests, adding a time trend to the right-hand side 
of each equation. We obtained qualitatively similar 
results with this procedure. 


Board of Governors of the Federal Reserve 
System, 1980 chart 1). By itself, of course, 
this correlation says nothing about any 
causality running from loans to inventories; 
it could just reflect fluctuations in inventory 
demand that are passively accommodated 
by bank loan supply. However, our results 
suggest that inventories could indeed be 
affected by the availability of bank credit. 
Thus financing costs may be important for 
inventories, but these costs may not be ade- 
quately captured by security-market interest 
rates alone. 

The results for investment are less uni- 
form. For equipment investment, the mix 
again appears to be very important, regard- 
less of the baseline model chosen. In con- 
trast, the mix appears to be much less im- 
portant for investment in nonresidential 
structures.*’ Nevertheless, in all cases, the 
impact of declines in intermediated financ- 
ing is aS expected: there is a negative short- 
run effect on investment, and this effect dies 
off at longer horizons. 

Table 2 also contains a second set of 
tests, in which the prime—CP spread is sub- 
stituted for the mix everywhere. As in the 
previous section of the paper, we find some 
evidence that the two variables contain simi- 
lar information. Like the mix, the prime—CP 
spread has significant explanatory power 
both for nondurable inventories and for 


7 This finding accords with the common result in 
the literature that it is notoriously difficult to find 
important financing effects for structures (see e.g., 
Robert J. Gordon and John M. Veitch, 1986). 
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NONDURABLES INVENTORIES 
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NONRETAIL DURABLES INVENTORIES 


INVESTMENT IN EQUIPMENT 


INVESTMENT IN STRUCTURES 


0.10 


FiGurE 3. IMpULSE RESPONSES AND APPROXIMATE 95-PERCE 
THE MIx IN NEOCLASSICAI 


0.06 


ENT CONFIDENCE INTERVALS FOR SHOCKS TO 
MobDELS 


Notes: Ticks along the x-axis mark quarters after the shock to the financing mix. Dotted lines show plus and minus 
two standard errors around the means, shown as solid lines. The means and confidence intervals were calculated 


from a Monte Carlo simulation using 500 replications. 


equipment investment. Unlike the mix, 
however, it adds little to the equations for 
durable inventories. Also unlike the mix, it 
is significant in two of the three structures 
specifications. 

While Table 2 demonstrates the statisti- 
cal significance of our loan-supply variables, 


it does not provide any guidance on their 
quantitative importance. We now attempt a 
rough calibration of these magnitudes. To 
begin, we compute the responses of invest- 
ment and inventories to shocks in the mix 
using the neoclassical specifications [equa- 
tions (16) and (20) augmented to include 
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lags of the mix]. We also compute confi- 
dence intervals using Monte Carlo simula- 
tions.2 The mean responses for a one-unit 
increase in the mix, along with standard- 
error bands are shown in Figure 3. As is 
typical, the range associated with plus and 
minus two standard errors around the path 
is fairly wide in all four cases, but at least in 
the short run, there appear to be some 
meaningful patterns in the data. 

Next, we calculate the impact of a one- 
time, 0.036 decline in the mix. This is a 
large decline, comparable to that seen in 
the two quarters after the Fed’s October 
1979 tightening. One year after such a shock, 
inventory investment would be reduced by 
about $36 billion (in 1982 dollars), equip- 
ment investment would be reduced by about 
$7.5 billion, and structures investment would 
be reduced by about $1.5 billion. Together 
this suggests a decline of roughly $45 bil- 
lion, which is just over 1 percent of GNP. 
While the standard errors on these esti- 
mates are large, the overall magnitudes seem 
reasonable, and the sectoral incidence is 
plausible: inventories are most strongly af- 
fected, while equipment investment is 
slightly affected, and investment in struc- 
tures is hardly affected. 


IV. Financing Variables as Predictors 
of Economic Activity 


In the previous section, we showed that 
financing variables such as our mix variable 
could help to predict certain components of 
economic activity, even after controlling for 
interest rates and other factors. These con- 
trols were necessary to address the specific 
hypothesis we were testing: whether condi- 
tion (ii) was satisfied and there was thus 
evidence of a distinct lending channel of 
monetary transmission. 

In this section, we shift our focus away 
from the issue of trying to differentiate 
carefully between the money and lending 
channels and address a less structured set 
of forecasting questions. In particular, we 


*8The results are based on 500 replications, assum- 
ing that the errors in each equation are normally 
distributed. 
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consider how our mix variable performs as a 
univariate “leading indicator” and contrast 
it with another variable that has received a 
great deal of attention in this regard: the 
spread between the rate on prime commer- 
cial paper and Treasury bills. 

Our results thus far suggest a couple of 
reasons why the mix might be a good uni- 
variate leading indicator. First, the evidence 
in Section III tells us that the mix has some 
forecasting power even when many other 
variables are included in the model. Second, 
the evidence in Section II implies that the 
mix might also do well in univariate fore- 
casting even if we had found no role for it 
in the multivariate setting. This is because 
even if condition (ii) fails to hold, and mon- 
etary policy thus works solely through a 
money channel, we know that the mix is 
correlated with changes in monetary policy. 

The insight that the mix may be a good 
proxy for the stance of monetary policy, no 
matter what the eventual channels of policy 
transmission, is also helpful in understand- 
ing the success of the CP-bills spread as a 
leading indicator. Several explanations for 
the spread’s success have been offered. The 
first is that the spread reflects expected 
future default risk and that this forward- 
looking property is what makes it a good 
predictor. Friedman and Kuttner (1993), as 
well as Bernanke (1990), argue that this is at 
best a partial explanation. For example, 
Bernanke points out that it is hard to recon- 
cile the 300-basis-point swings seen in the 
spread with changes in default expectations, 
given that defaults on prime commercial 
paper are extremely rare. Furthermore, both 
papers note that the CP-bills spread is not 
closely correlated with other more natural 
measures of default risk, such as the spread 
between BAA corporate bonds and Trea- 
sury bonds, or the spread between low-grade 
and high-grade commercial paper. 

A second possible explanation, put for- 
ward by Friedman and Kuttner (1993), fo- 
cuses on the changes in companies’ demand 
for funds that occur around cyclical turning 
points. To the extent that the end of an 
expansion is characterized by a buildup in 
inventories, there will be an increased de- 
mand for such short-term sources of funds 
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TABLE 3—CAUSALITY TESTS FOR MIX AND Six-MONTH CP-BILLs SPREAD 


1964-1989 
Spread Mix 


1964—1979:3 
Spread Mix 


1979:4-1989 
Spread Mix 


Dependent variable 


Capacity utilization 0.048 0.426 (0.110 0.007 
Log employment 10-8 0.003 10-° 0.063 0.005 0.078 
Log housing starts 0.015 0.105 0.059 0.660 0.229 0.054 
Log personal consumption 0.00002 0.314 0.001 0.127 0.049 0.460 
Log personal income 005 0.572 0.021 0.438 0.423 0.359 
Unemployment 0.017 0.266 0.012 0.006 
Industrial production 0.007 10° 0.221 0.017 0.004 
Log durable goods orders a 0.067 10-' 0.278 0.036 0.161 
Log retail sales 0.0002 0.645 0.003 0.635 0.049 0.529 


Notes: Entry in each cell is the significance level from a test that four lags of the mix 
or spread add explanatory power to a regression that already includes four lags of the 


dependent variable. All nonstationary variables have been differenced. 


as commercial paper. If commercial paper 
and Treasury bills are imperfect substitutes, 
this demand-side effect should widen the 
spread between the two. Friedman and 
Kuttner (1993) find empirical support for 
this proposition. 

A third explanation for the predictive 
prowess of the spread—discussed by both 
Friedman and Kuttner (1993) and Bernanke 
(1990)—is that the spread contains informa- 
tion about the stance of monetary policy. 
Our results in Section II support this hy- 
pothesis. Specifically, our results point to 
one clear reason why the CP-bills spread 
may be a good proxy for the stance of 
monetary policy: if tight money leads to an 
increase in commercial paper issuance, and 
if the commercial paper market is less than 
perfectly “deep,” then the end result will be 
an increase in paper rates relative to Trea- 
sury rates. In other words, the spread may 
be a good proxy for the stance of monetary 
policy for exactly the same reasons that the 
mix is. 

However, this line of reasoning raises the 
following question: should one expect the 
spread to continue to be a good forecaster 
in the future? Bernanke (1990) also pro- 
vides evidence on this point. He shows that 
the forecasting ability of the spread has 
deteriorated noticeably over the 1980’s. Ac- 
cording to the story above, this might be 
expected if the spread’s variation is driven 
by fluctuations in paper issuance. Over time, 
as markets become deeper and more liquid, 


this “price-pressure” effect may be reduced. 
Even if monetary policy continues to have a 
significant impact on paper volume, the im- 
pact on paper rates may diminish. This 
would weaken the forecasting power of the 
spread, while leaving the forecasting power 
of the mix unaffected.”? 

To investigate this hypothesis, we com- 
pare the forecasting properties of the 
CP -bills spread and our mix variable for a 
host of indicators of economic activity over 
several sample periods. Specifically, we fol- 
low Bernanke and Blinder (1992) and study 
the following activity measures: industrial 
production, capacity utilization, employ- 
ment, the unemployment rate, housing 
starts, personal income, retail sales, con- 
sumption, and durable-goods orders. We 
consider three sample periods: 1964-1989, 
1964—1979:3 and 1979:4-1989. The sample 
break point was suggested by Bernanke’s 
(1990) work on the changing forecasting 
ability of the CP-bills spread.” 

In each regression, we run the activity 
variable on four lags of itself and four lags 
of either the mix or the spread.*! These 
results are shown in Table 3. The results for 


’Bernanke (1990) describes other reasons why mar- 
ket developments might lead to a reduction in the 
forecé isting power of the spread. 

“By starting in 1964 we capture the 1966 episode 
without using too much data from the period before 
the, first major expansion in the paper market. 

‘tall nonstationary variables have been differenced 
in these regressions. 
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the CP-bills spread confirm Bernanke’s 
(1990) findings that the information content 
of the spread has fallen off noticeably in the 
1980’s; indeed, the spread’s explanatory 
power declines in the later subsample for 
every activity variable we study. In most 
cases, the spread still seems to have some 
predictive power, but overall the results are 
much less impressive. 

In contrast, the forecasting performance 
of the mix improves over the later subsam- 
ple for seven of the nine activity variables. 
Moreover, in the later sample period, the 
mix actually outperforms the spread for five 
of the nine activity measures in these fore- 
casting contests. This is in sharp contrast to 
the earlier sample period, where the spread 
uniformly dominated the mix. On balance, 
these results do suggest that the informa- 
tion content in the mix has held up better 
than that of the spread as the commercial 
paper market has deepened. Still, even over 
the 1980’s, it does not appear that the mix is 
an unambiguously better leading indicator 
than the spread. 


Conclusions 


There are two necessary conditions that 
must be satisfied in order for monetary pol- 
icy to affect the economy through a lending 
channel. First, banks must view loans and 
securities as imperfect substitutes on the 
asset side of their balance sheets, so that 
monetary tightening does indeed reduce the 
supply of bank loans. Second, loans and 
nonbank sources of finance must also be 
imperfect substitutes for firms on the liabil- 
ity side of their balance sheets, so that re- 
duced loan supply has real effects. The data 
we have examined suggest that both condi- 
tions are satisfied: shifts in monetary policy 
seem to alter the mix of loans and commer- 
cial paper, and the induced shifts in this mix 
seem to affect investment (even controlling 
for interest rates). 

In this paper, we have looked only at 
aggregate data. However, the lending view 
of monetary-policy transmission also has a 
number of important cross-sectional impli- 
cations. To take just one example, the lend- 
ing view suggests that the inventory and 
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investment declines that follow a monetary 
contraction should be disproportionately 
concentrated among ‘“bank-dependent” 
firms (i.e., firms with scarce cash reserves 
and without ready access to public bond 
and commercial paper markets). In current 
work (Kashyap et al., 1992) firm-level data 
are being used to investigate this hypothe- 
sis. 


Data APPENDIX 


The data used in the paper can be divided into four 
categories. Except where otherwise noted, all data come 
from the Federal Reserve data base. 

Financing Measures.—The bank loan and commer- 
cial paper series used in our investigation are taken 
from the Federal Reserve Board’s (1980) flow-of-funds 
publication. The bank loan series is the sum of two 
series. The first series is “Bank Loans, NEC,” line 25, 
in the “Nonfarm Non-Corporate Business” sector. The 
second series is “Bank Loans, NEC,” line 34, in the 
“Nonfinancial Corporate Business” sector, excluding 
farms. The commercial paper series is taken from line 
36 of the ‘“Nonfinancial Corporate Business” sector, 
excluding farms. Finally, the bond issuance series was 
constructed by splicing a Moody’s series and one 
tracked by the Federal Reserve. The nominal quantity 
data were adjusted using the 1982-based GNP deflator. 

Interest Rates.—We use a variety of interest rates in 
this study: the federal funds rate, the six-month Trea- 
sury bill rate, six-month high-quality commercial paper 
rate, the prime rate, and the 10-year constant-maturity 
government security. These are all standard series. 

Inventory and Investment Data.—The variables 
needed to run the inventory equations come from the 
MPS quarterly-model data base and the National In- 
come and Product Accounts. The MPS data are de- 
scribed at length in Flint Brayton and Eileen Mauskopf 
(1985) and are available upon request. The data on Q 
and investment were kindly provided by Glenn D. 
Rudebusch and are described in Stephen D. Oliner 
et al. (1992). 

Business-Cycle Indicators.—The data for the indica- 
tors of economic activity used in Section IV are all 
available from a number of sources. We used the 
Federal Reserve Board data base; Bernanke and Blin- 
der (1992) use the same data (obtained from Data 
Resources, Inc.) and provide the DRI mnemonics. 
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Strategic Discipline in Monetary Policy 
with Private Information: 
Optimal Targeting Horizons 


By MicHELLE R. GARFINKEL AND SEONGHWAN OH* 


This paper analyzes a multiperiod monetary targeting procedure as a possible 
resolution to the credibility problem in policy when the monetary authority has 
some private information. By limiting the degree of flexibility permitted in policy, 
this procedure mitigates the credibility problem. As the length of the targeting 
horizon decreases, the severity of the credibility problem falls, but at the expense 
of weakening the monetary authority’s ability to pursue its stabilization goals. 
Based on model simulations, the analysis studies the determinants of the optimal 
targeting horizon that balances the benefits of flexibility and discipline in policy. 


(JEL ESO, F61) 


How much flexibility or discretion should 
be given to the monetary authority? The 
answer to this time-honored question, in the 
monetarist spirit, is essentially none. Under 
the presumption that the monetary author- 
ity lacks an ability to forecast accurately or 
is able to forecast, at best, as well as eco- 
nomic agents, discretionary policy only cre- 
ates an additional element of uncertainty 
that unnecessarily complicates economic 
agents’ decision problems. Hence, the mon- 
etary authority should follow a strict rule, 


*Garfinkel: Department of Economics, University 
of California-Irvine, Irvine, CA 92717; Oh: Depart- 
ment of Economics, University of California-Los Ange- 
les, Los Angeles, CA 90024, and Department of Eco- 
nomics, Seoul National University, Seoul, 151-742, 
Korea. The first draft of this paper was completed 
while the first author was an economist and the second 
author was a visiting scholar at the Federal Reserve 
Bank of St. Louis. The views expressed in this paper 
are those of the authors and do not necessarily reflect 
the views of the Federal Reserve Bank of St. Louis or 
the Federal Reserve System. We thank, without impli- 
cating, Gary Hansen, Gregory Hess, Axel Leijon- 
hufvud, participants in the UCLA Money Workshop, 
and especially Guido Tabellini and two anonymous 
referees for their invaluable comments on earlier drafts 
of this paper. Scott Leitz provided useful technical 
assistance. 


for example, a constant-money-growth-rate 
rule.'| Against the monetarist prescription, 
proponents of an activist rule for the mone- 
tary authority argue that the inflexibility of 
a strict rule precludes an optimal response 
to unanticipated disturbances. 

To this already complicated question, re- 
cent developments in the literature have 
added more complexity, highlighting the 
strategic elements of monetary policy. When 
the market-determined output or employ- 
ment level is deemed suboptimal, a benevo- 
lent monetary authority might try to raise 
employment or output by surprising agents 
with high inflation if temporary nominal 
rigidities are present. Rational, forward- 
looking agents, recognizing this incentive, 
set high rates of wage inflation to discour- 
age the monetary authority from trying to 
reduce their real wage below their target 
level. Hence, even if the monetary authority 
is perfectly benevolent in that it maximizes 


‘See Allan H. Meltzer (1990), whose skepticism 
about the Fed’s forecasting ability leads him to advo- 
cate a constant-money-growth-rate rule. Axel Leijon- 
hufvud (1984) provides a detailed discussion about the 
uncertainty generated by discretionary policy under 
different monetary regimes. 
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social welfare, its policy might be inefficient; 
the equilibrium could be characterized by 
excessively high average inflation—an in- 
flationary bias. As recognized since Finn 
Kydland and Edward Prescott’s (1977) semi- 
nal paper, the efficient policy that avoids 
this bias might not be dynamically consis- 
tent or, more precisely, incentive-compati- 
ble. 

Generally, when there is complete infor- 
mation, in the sense that individuals know 
the monetary authority’s preferences and 
observe the realizations of the stochastic 
variables that constrain its choices, the rep- 
utational mechanism can serve to eliminate 
or, at least, to diminish the extent of the 
suboptimality arising from this dynamic con- 
sistency problem (see Robert J. Barro and 
David Gordon, 1983; Kenneth Rogoff, 
1987).* If, however, the monetary authority 
has some private information, the efficacy of 
the reputational mechanism is called into 
question. Specifically, as shown by Matthew 
B. Canzoneri (1985), when the information 
structure is incomplete so that individuals 
cannot verify that the monetary authori- 
ty has not intentionally invalidated their 
expectations, the efficacy of reputational 
considerations to resolve the credibility 
problem is weakened.? Moreover, private 
information precludes the effectiveness of 
any commitment technology to force the 
monetary authority to adhere to the optimal 
rule unless there is a separate mechanism to 


For a general discussion about dynamic inconsis- 
tency, see Brian Hillier and James Malcomson (1984). 
A necessary (but not sufficient) condition for dynamic 
inconsistency is that the government have fewer instru- 
ments than objectives. 

“Following Edward J. Green and Robert H. Porter 
(1984), however, Canzoneri (1985) shows how reputa- 
tional considerations can mitigate the credibility prob- 
lem partially, even in the presence of private informa- 
tion. The solution rests on a trigger strategy that does 
not require explicit verification of the monetary author- 
ity’s private information. However, this resolution is 
not complete. While the monetary authority adheres to 
the “reputational policy,” excessively high rates of in- 
flation would emerge on occasion. This solution, then, 
generates an expected rate of inflation in excess of the 
socially desired rate. 
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force the monetary authority to reveal its 
private information truthfully. 

Given these difficulties, Canzoneri (1985) 
suggests taking a J/egislative approach in or- 
der to resolve the credibility problem when 
the policymaker has private information.* 
This approach assumes an incomplete com- 
mitment technology that cannot force the 
monetary authority to reveal its private in- 
formation truthfully. Nevertheless, since a 
legislated procedure can be specified in 
terms of variables that are observed by all 
market participants (e.g., the growth rates 
of the monetary aggregates), this approach 
is potentially attractive. One example, stud- 
ied by Canzoneri, is a two-period targeting 
procedure requiring that the average money 
growth rate per two periods equal the so- 
cially desired rate. More generally, provided 
that the procedure does not depend on the 
monetary authority's private information 
and the monetary authority can control per- 
fectly the targeted variable, the procedure is 
operational. 

This paper extends Canzoneri’s (1985) 
analysis of a two-period targeting procedure 
in a multiperiod setting to investigate the 
scope of flexibility, in terms of the number 
of periods in the targeting horizon, that 
should be given to the monetary authority 
facing a trade-off between its output and 
inflation objectives. Legislation passed by 
Congress, for example, could mandate that 
the monetary authority announce a target- 
ing horizon, over which time the growth of 
the money stock must average the rate cor- 
responding to the (given) socially optimal 
inflation rate. Such an extension permits the 
analysis to identify the key factors that in- 
fluence the optimal choice of the targeting 
horizon which balances credibility and flex- 


*For complementary analyses of monetary policy 
with private information, see Alex Cukierman and 
Meltzer (1986) and Jeremy C. Stein (1989). In contrast 
to these analyses, which mainly focus on maintained 
secrecy in monetary policy arising from private infor- 
mation, the present paper tries to specify an opera- 
tional rule that improves upon the deadlock situation 
arising from the credibility problem. 
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ibility in monetary policy when there is pri- 
vate information.° 

Under this procedure, the longer the tar- 
geting horizon, the greater is the degree of 
flexibility permitted in policy. However, as 
the targeting horizon increases, the greater 
is the monetary authority’s willingness to act 
on its incentive to create surprise inflation 
in each period, and thus the greater is the 
(gross) inflationary bias in each period. In 
addition, the overall targeting constraint re- 
quires that these biases be reversed in sub- 
sequent periods. As the targeting horizon 
increases, then, the magnitude of those re- 
quired reversals also increases. Thus, as 
shown below, the contribution of the re- 
maining credibility problem to the variance 


Other existing analyses of institutional arrange- 
ments that can possibly mitigate the inefficiencies asso- 
ciated with the credibility problem (e.g., Rogoff, 1985; 
Susanne Lohmann, 1992) similarly emphasize the 
trade-off between credibility and flexibility. However, 
they abstract from the complications that can arise 
when the monetary authority has (nonverifiable) pri- 
vate information. Indeed, as argued by Canzoneri 
(1985), Rogoff's (1985) perverse policymaker solution 
to the credibility problem would not be effective when 
the monetary authority has private information con- 
cerning its own preferences as well as the realizations 
of the stochastic variables that constrain its choices. 
This criticism would apply as well to Lohmann (1992), 
who combines Rogoff’s (1985) perverse policymaker 
solution with Robert Flood and Peter Isard’s (1989) 
mixed-strategy solution. In addition, as will be dis- 
cussed, these analyses focus on flexibility within a pe- 
riod, rather than over a number of periods, resulting in 
a slightly different (but complementary) notion of flex- 
ibility from that developed here. In any case, because 
the institutional solutions (including the one analyzed 
in this paper) involve limits on policy, they are not 
complete resolutions to the credibility problem. Gener- 
ally, whether the institutional approach or the reputa- 
tional approach provides a more effective resolution 
would depend, in practice, on the monetary authority’s 
ability to influence expectations relative to its ability to 
design and establish institutions. Answering this ques- 
tion is difficult at best. Nevertheless, it should be 
feasible to derive the conditions under which any one 
of the institutional solutions produces a smaller dead- 
weight loss than that associated with Canzoneri’s (1985) 
reputational solution. Although the present analysis 
does not analytically derive those conditions, a numeri- 
cal example in which the reputational solution is clearly 
dominated by the targeting procedure studied here is 
presented (see footnote 16). 
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in inflation around the socially optimal rate 
increases as the targeting horizon increases. 
The optimal length of the targeting horizon 
embodies the optimal trade-off between the 
cost of the remaining credibility problem 
and the cost of limiting flexibility to accom- 
modate the predicted component of money 
demand shocks (i.e., the monetary author- 
ity’s private information) relative to the pol- 
icy that is fully discretionary. 

As the monetary authority attaches 
greater importance to its objective for infla- 
tion relative to its objective for output, the 
optimal targeting horizon becomes longer. 
The basic intuition for this result is that the 
monetary authority’s incentive to create sur- 
prises decreases, thereby decreasing the 
severity of the credibility problem and, 
hence, increasing the possible benefits of 
flexibility afforded by the N-period target- 
ing procedure to permit the monetary 
authority to react to its private informa- 
tion. For the extreme case in which the 
inflationary bias that emerges in the full- 
discretionary regime is so small that the 
constraint imposed by the targeting proce- 
dure becomes unnecessary, the optimal 
number of periods in the targeting horizon 
is infinite. For the other extreme case, in 
which the credibility problem is sufficiently 
severe to make the optimal targeting hori- 
zon equal to one period, the procedure sim- 
plifies to a constant-money-growth rule. 
Such a rule eliminates the equilibrium in- 
flationary bias associated with the credibility 
problem, but at the cost of removing all 
flexibility in policy. Nevertheless, as the 
weight that the monetary authority attaches 
to its inflation objective relative to its out- 
put objective approaches zero, the 
constant-money-growth rule is more likely 
to emerge as the optimal targeting policy. 

In what follows, the next section briefly 
presents a model of monetary policy with 
private information in a multiperiod setting 
and analyzes the monetary authority’s opti- 
mal, time-consistent monetary policy subject 
to the N-period average targeting con- 
straint. Section II presents simulations of 
the model to illustrate how the parameters 
of the model influence the optimal choice of 
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the targeting horizon. These simulations 
suggest that, for many parameter values, the 
cost of limiting flexibility to reduce the 
severity of the credibility problem will be 
too large to render a finite-period targeting 
procedure desirable; but, for many other 
parameter values, some discipline in policy 
will be warranted, even if the monetary 
authority’s forecasting ability is fairly accu- 
rate. Indeed, for a large part of the parame- 
ter space, the constant-money-growth rule 
emerges as the optimal policy. The present 
analysis suggests, then, that the credibility 
problem, as well as the monetary authority’s 
forecasting ability, plays an important role 
in the optimal design of feasible monetary 
rules. Finally, Section III offers some con- 
cluding remarks, including possible exten- 
sions of the analysis. 


I. A Model of Monetary Policy with 
Multiperiod Targeting 


This section presents a simple economic 
model to investigate the efficacy of multi- 
period average targeting procedures to sup- 
port a better outcome than that achieved 
with full discretion when the monetary au- 
thority has private information. Following 
conventional practice, the analysis builds on 
a standard rational-expectations supply 
function, 

(1) y,=y"+0(7, — 77) 6>0 

where y, denotes the logarithm of output in 
period t, y” denotes the logarithm of the 
natural level of output, 7, is the inflation 
rate in period ¢, and 7r/ is the wage setters’ 
expection of inflation conditional on infor- 
mation available at the end of period t —1.° 


°The emergence of the inflationary bias, discussed 
later in the paper, rests on the assumption embedded 
in this equation that workers supply labor along the 
demand schedule. However, as Cukierman (1992 Ch. 3) 
argues, it is highiy unlikely that employment is deter- 
mined along the labor demand curve when labor de- 
mand is in excess of supply. Cukierman demonstrates 
that, alternatively, when employment is viewed as be- 
ing determined by the “short side” of the market, the 
monetary authority’s incentive to create surprise infla- 
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The following money demand equation 
determines the equilibrium price level: 


n 


(2) Mm, 


where m, and p, equal, respectively, the 
logarithms of the money stock and the price 
level in period ¢ and v, is a money demand 
disturbance realized at the end of period f. 
The disturbance is assumed to follow a ran- 
dom walk. The equilibrium inflation rate is 
obtained by taking the first difference of (2), 


where 6,=vU,_,—U, and g, is the growth 
rate of money, the monetary authority’s in- 
strument;’ 5, is an independently and iden- 
tically distributed random variable with a 
zero mean and a finite variance, oe. When 
wages are set, 5, is not known (i.e., 5° = 0). 
The expression in (3), then, implies that 
wage setters’ expectation of inflation de- 
pends solely on the monetary authority’s 
policy, g,. This expectation, gf, ultimately 
depends on the monetary authority’s strat- 
egy. 


tion disappears and so does the inflationary bias. Nev- 
ertheless, some degree of unionization that keeps real 
wages above the real market-clearing wage can give 
the monetary authority an incentive to create surprise 
inflation to reduce (unexpectedly) the real wage, be- 
cause the union’s best strategy is to encourage its 
members to supply labor ex post along their individual 
labor supply curves. See Cukierman (1992) for the 
conditions under which the presence of unions gives 
rise to an inflationary bias. 

7As indicated earlier, for the targeting procedure to 
work, the analysis must assume that the monetary 
authority can control the growth rate of money per- 
fectly if that is its target. That the Federal Reserve 
System chooses to target the more broadly defined 
monetary aggregates (i.e., M2 and M3) rather than 
those over which it has more control (i.e., high-powered 
money and M1) could be related to the secrecy prob- 
lem. See, for example, Cukierman and Meltzer (1986) 
who show that the monetary authority might choose a 
less precise procedure for policy implementation so as 
to maintain some degree of ambiguity, whereby the 
secrecy of its information could be maintained to some 
extent. The present analysis, however, does not permit 
meaningful distinctions among the various monetary 
aggregates. 
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To study alternative strategies for the 
monetary authority, the analysis to follow 
assumes that the monetary authority chooses 
its policy to maximize its expected N-period 
average utility: 


N 


(4) U'= Dw 


t=1 


l<N<o 


where one-period utility in period f¢ is 
given by 


> 1. 


E{-} is an expectations operator condi- 
tional on period t =7 information, and N 
denotes the number of periods in the target- 
ing horizon.* For N =, the expected aver- 
age utility is given by E,{u’}. The logarithm 
of the socially desirable output level is rep- 
resented as ky", and z* is the socially 
determined optimal rate of inflation; s is 
the weight the monetary authority attaches 
to its goal of inflation stability relative to its 
goal of output stability.” 


5N can be thought of as the time horizon of an 
individual policymaker. However, given the stationary 
nature of the model, the solution would not change if, 
instead, the policymaker were infinitely-lived since the 
infinite-horizon problem can be decomposed into inde- 
pendent N-period-horizon problems. The implicit as- 
sumption here, that the monetary authority does not 
discount the future, is not strong. A discount factor 
less than 1 only complicates the analysis without pro- 
viding much additional insight. (A copy of the solution 
in the case of discounting is available from the authors 
upon request, and the effects of discounting on the 
optimal targeting horizon are discussed briefly at the 
end of Section II.) 

See Canzoneri (1985 pp. 1058-9) for a detailed 
discussion of the possible interpretations of the as- 
sumption that k >1, an assumption that is important 
for explaining the emergence of a positive inflationary 
bias. These interpretations build on distortions (pre- 
sumed to exist in the economy) that depress average 
output, or the natural level of output, below the level 
considered to be optimal from a social-welfare per- 
spective (but see footnote 6 and Cukierman [1992 Ch. 
3], who provides a useful critique of this social-welfare 
approach). As an alternative, Cukierman (1992) adopts 
a political approach to study central-bank behavior. 
This approach interprets the monetary authority’s out- 
put objective with k >1 as representing those interest 
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Letting U =U’/@? and u,=w',/@* and 
using (1) and (3) with the assumption that 
5°=0, the monetary authority's expected 
average N-period utility can be rewritten as 


N 
Lu, 


t=1 


1 
N 


(5) U=E, 
where 

e * 2 

9 ) 


f(g,-8,-7*) 


and f =(s/0*) and y* =(k —-1)y"/@. 
After period-t wages are set, the mone- 
tary authority chooses its policy, g,. In con- 
trast to wage-setters, the monetary authority 
receives some information about the distur- 
bance to money demand (6,) after wages 
are set but before policy actions are taken. 
Specifically, it has a private forecast of this 
disturbance, d, = E{6,}, that satisfies 


(6) 5,=d,+, 


where e, is an independently and identically 
distributed disturbance realized after policy 
is implemented. This forecast error has a 
zero mean, a finite variance, 02, and no 
correlation with d,. Similarly, d, is indepen- 
dently and identically distributed with a zero 
mean and a finite variance, 0,7. Thus, 0; = 
a; +a. Section II discusses the case when 
d, is serially correlated. Although wage- 
setters observe 5, and 7, after g, is set, 
they cannot distinguish the monetary au- 
thority’s forecast, d,, from the forecast er- 
ror, ¢,.'° 


groups advocating economic stimulation. Under this 
interpretation, the parameter s reflects “‘a compromise 
reached through the central bank between the advo- 
cates of economic stimulation and the advocates of 
price stability” (Cukierman, 1992 p. 43). This approach 
would seem inappropriate for the present representa- 
tive-agent model, however. 

‘Given the sequence of actions by wage-setters and 
the monetary authority relative to the timing of the 
realization of the monetary authority’s private informa- 
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Before the optimal, time-consistent mon- 
etary policy with multiperiod targeting is 
derived, two benchmark solutions are pre- 
sented. The first solution, the efficient solu- 
tion, is that which would obtain if there 
were a full commitment technology permit- 
ting the monetary authority to adhere to a 
contingent rule, while truthfully revealing 
its private information. The second solution, 
the full-discretionary solution, assumes that 
the monetary authority takes wage-setters’ 
expectations as given. It is identical to the 
solution that would emerge under an infi- 
nite targeting procedure, since N= is 
equivalent to no constraints on policy. 


A. The Efficient Solution 


Assuming a full commitment technology 
to force the monetary authority to reveal its 
private information truthfully and to adhere 
to a contingent rule so that it can influence 
wage-setters’ expectations effectively in a 
way that is consistent with its policy, the 
optimal monetary policy, g, is given by 


(7) &,=7* +d 


t 


for t=1,2,..., which yields the following 
expected average utility for the monetary 
authority: 


(8) O=-(1+f)o2-(y*)’. 


tion and the transitory nature of that private informa- 
tion, Vincent Crawford and Joel Sobel’s (1982) notion 
of cheap talk, recently applied to the monetary-policy 
problem by Stein (1989), is not operational in the 
context of this model. (See Oh and Garfinkel [1990] for 
a brief discussion of why the two-dimensionality of the 
credibility problem in monetary policy with private 
information obscures the relevance of the cheap-talk 
mechanism. To make cheap talk effective in this frame- 
work with a slight variation in the sequence of events, 
it is necessary to limit flexibility in monetary policy.) 
Section III discusses one extension of the model to 
incorporate the possibility of cheap talk in the multi- 
period targeting procedure. It should be noted here 
that the assumption that 6¢ = 0 is not crucial. Even if 
wage-setters had reasonable forecasts of 5,, the analy- 
sis to follow would be relevant, provided that the 
monetary authority’s forecast is private. 
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Note that the money growth rule in (7) 
completely accommodates money demand 
shocks to stabilize inflation around the so- 
cially desired rate. 


B. The Full-Discretionary Solution" 


Under an alternative assumption that the 
monetary authority behaves as if it can 
have no impact on wage-setters’ expecta- 
tions, it maximizes the expectation of (5) 
conditional on its forecast of the distur- 
bance to money demand, subject to (6) and 
taking g* as given. The wage-setters’ expec- 
tation of the associated first-order condition 
implies gf = 7* + y* /f. By substituting g* 
back into the first-order condition, one can 
verify that the money growth rate under this 
regime, g, is given by 


¥* 
(9) 


Note that the monetary authority’s pri- 
vate information is fully revealed in the 
outcome. Although the monetary authority 
can do no better than to set its policy ac- 
cording to (9) given wage-setters’ expecta- 
tions, the monetary authority’s incentive to 
create surprise inflation generates an infla- 
tionary bias, y*/f. Even if the monetary 
authority had no private information, this 
bias would emerge, provided that it took 
expectations as given. 

The inefficiency of the full-discretionary 
sclution is revealed by comparing the ex- 
pected average utility of the monetary au- 
thority in this regime, given by 


(10) U=-(1+f)o2- 


*)? 
FIO”? 


to that obtained in the efficient regime, 
given by (8). The difference between (8) and 


"In the static framework, this solution is equivalent 
to the (one-shot) Nash solution. As already noted and 
as discussed in part C of this section, however, this 
solution more generally is interpreted as an infinite- 
period, average targeting procedure. 


3 
| 
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(10), (y*)?/f, captures the disutility of the 
inflationary bias. 


C. The N-Period Targeting Solution 


As is widely recognized, while the full- 
discretionary policy is not first-best, the ef- 
ficient solution is not necessarily a feasible 
outcome in the absence of a full commit- 
ment technology. That is, the monetary pol- 
icy in (7) is not incentive-compatible. Once 
wages are set, the monetary authority’s opti- 
mal policy no longer corresponds to (7). 
Rather, the monetary authority would like 
to set g,=7* +d,+ y*/(+f). In addi- 
tion, the monetary authority’s private infor- 
mation obscures the role of reputational 
considerations to diminish the inflationary 
bias. Specifically, without a complete infor- 
mation structure, wage-setters cannot verify 
that the monetary authority has followed 
the efficient policy.'* Moreover, the mone- 
tary authority’s private information pre- 
cludes the effectiveness of legislation requir- 
ing the monetary authority to implement 
policy according to (7). Even if the legisla- 
tion were binding, the monetary authority 
would have an incentive to lie, claiming that 
its forecast of 6, equaled d,+ y* /(+ f), 
whereby it could disguise the optimal cheat- 
ing policy (given g*=7*) as the efficient 
one. Without any additional constraints on 
policy, the presence of nonverifiable private 
information means that the monetary au- 
thority cannot influence wage-setters’ ex- 
pectations effectively, and the full-discre- 
tionary policy emerges in the equilibrium 
outcome. 

As a possible resolution to the ineffi- 
ciency of the inflationary bias when the 
monetary authority cannot credibly reveal 
its private information, Canzoneri (1985) 
studies a two-period average targeting pro- 
cedure, requiring g,+ g,=27*. This pro- 
cedure is attractive not only in its simplicity, 
but in its independence from the monetary 
authority’s private forecast. Such indepen- 
dence makes it operational, for enforce- 
ment of the (precommitted) constraint does 


'2 But see footnote 3. 
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not require verification of d,.'° As a gener- 
alization of Canzoneri’s analysis, consider 
an extended average targeting procedure 
that requires 


(11) 


where 1<N<o. For N=, there is no 
relevant constraint on the conduct of policy. 

To derive the optimal, dynamically con- 
sistent monetary policy under this (precom- 
mitted) targeting regime, a backward solu- 
tion concept is appropriate. Specifically, in 
period ¢=N-—1, the monetary authority 
maximizes the sum of the expected value of 
(5) in t=N and t= N-—1 with respect to 
gy, Subject to the forecast d,_, and (11). 
In solving this optimization problem, the 
monetary authority takes gx, and gy,_, as 
given while recognizing the impact that g,/_, 
will have on future expectations, gy, 
through the constraint in (11). Repeating 
this exercise for t= N-—2, N-—3, ..., one 
can find that 


(N—t)(1+ f) 
N-t+(N-t+l1)f 


N, where g denotes the 


‘To be sure, the analysis to follow implicitly as- 
sumes that a precommitment to adhere to the targeting 
constraint in (11) is credible. Even if the monetary 
authority's expected average utility under this proce- 
dure (for a given N) exceeds that which would be 
obtained under the full-discretionary solution (N =~), 
the constraint need not be incentive-compatible in any 
period during the targeting horizon. In other words, in 
any period t < N <, given expectations, the monetary 
authority has an incentive to violate the constraint. 
This incentive undermines the credibility of the pre- 
commitment. The present analysis implicitly assumes, 
then, that there exists an external enforcement mecha- 
nism (e.g., legislation effectively requiring that the 
monetary authority be severely punished if it fails to hit 
that average period target over the prespecified hori- 
zon, N) to support the procedure provided that it 
yields a better outcome than that obtained under full 
discretion, a condition that is examined in Section II. 
Note that, when this condition is satisfied, reputational 
considerations might be able to substitute for the as- 
sumed commitment technology to support the targeting 
rule in equilibrium. 


N 
8,=7*N 
t=] 
2) 
fos °° | 
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money growth policy under the N-period 
targeting procedure and 


(N-t)y* 
= 
N=-t+1  (N-t+1)f 
é, 
N-t+1 


for t=1,2,...,N. Note that, in the expres- 
sion above, the summation is taken only for 
t>1. (In what follows, Li is taken to be 
equal to zero whenever s>t.) The Ap- 
pendix illustrates the solution method used 
here more explicitly. 

To eliminate past g so as to express the 
optimal policy only in terms of current and 
past d as well as the parameters of the 
model, (12) is used to find g, and sequen- 
tially used to find g,, f = 2,3,...,.N, by sub- 
stituting in past g,. That is, 


13 
(13) &,=T(t) 
where 
Na* (N-—t)y* 
(N-t)(1+ f) 


N-t+(N-t+l1)f ‘ 


for t=1,...,N. By rearranging (13) and 

simplifying, one can verify that the optimal 

monetary policy subject to the N-period tar- 

geting constraint is given by the following: 
(N-t)y* 


OF 


(N-t)(1+ f) 
N-t+(N-t+l1)f 


* 
¢-1 y 


(N—-7+)f 
1 

+ d. 


for t= 1,2)... 
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Expected average utility under this regime 
is given by 


(15) U=-(y*y-(+ 


fratf) 1 


Note that the sum of the first two terms in 
the above expression equals the expected 
average utility that would be realized if the 
efficient outcome could be supported in 
equilibrium, (8). Hence, the last three terms 
reflect the inefficiency of the targeting pro- 
cedure.4 


Sor N=1, equation (15) reduces to the average 
expected utility under a constant-money-growth rule, 
given by —(y*)*—(1+f)o?. In this case, the last 
three terms reduce to (1+ flo, the cost arising from 
removing all flexibility in policy. As N approaches 
infinity, (15) reduces to (10), the average expected 
utility under the full-discretionary regime. To verify 
this, note that the third term approaches (y*)?/f as N 
approaches infinity, while the last two terms converge 
to zero. The convergence of the last term to zero can 
be verified easily by observing that the double summa- 
tion of that term can be written as 


2 


1 
N, 


Since the limit of (t—1)/(t—1+ tf)? converges to 
zero as t approaches infinity, the arithmetic mean of 
such a sequence will also be zero (see Walter Rudin, 
1976 p. 80). Thus, when N =~, the last three terms in 
(15) boil down to (y*)?/f, the disutility of the inflation 
that would emerge in the full-discretionary solution. 
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FiGurE 1. NORMALIZED INEFFICIENCY OF MULTIPERIOD TARGETING 
PROCEDURES 


The optimal targeting horizon, N*, maxi- 
mizes (15). Assuming that N* is finite,'> the 
average inefficiency is increasing in N for 
N> N*, as shown with an example in Fig- 
ure 1. The horizontal line in the figure 
measures the normalized inefficiency of the 
full-discretionary solution when f = 1.0 and 
y* /o,=1.35 [ie., (y*)?/foj = 1.82]. The 
inverted V-shaped curve measures the nor- 
malized, average inefficiency of a multi- 
period targeting over various N, showing 
that N* =6 for the chosen parameter val- 
ues.'© 

Given the N-period targeting constraint, 
the monetary policy in (14) is dynamically 


'Saithough writing down a closed-form solution for 
N* is extremely difficult, it is possible to verify that N* 
can approach infinity for a certain set of parameter 
specifications (see Section ID). 

It should be noted that, for a; < 1.5, Canzoneri’s 
(1985) reputational solution would not be feasible since 
it requires that y* /f > 2 (p. 1065). For this numerical 
example, then, the reputational solution is clearly dom- 
inated by the institutional solution studied in this pa- 
per, when a, < 1.5. 


consistent, like the full-discretionary solu- 
tion without any constraints. Note, however, 
that for finite N the gross inflationary bias 
li.e., the second term in (14)] is lower than 
that in the full-discretionary solution. From 
(13) or (14) in t =1, for example, one can 
verify that the bias is (N —1)y* / Nf, which 
is reversed in each of the subsequent N —1 
periods, in increments of y*/Nf. The in- 
crements are of the same magnitude since 
the monetary authority does not discount 
the future. Similarly, the bias in t=2 is 
(N —2)y* /(N—Wf, which is reversed in 
the subsequent N —2 periods in increments 
of y*/(N-\A)f. As t approaches N, the 
monetary authority’s willingness to act on 
its incentive to create surprise inflation falls, 
and so does the gross inflationary bias. At 
the same time, there is an increasing accu- 
mulation of past biases to be reversed. Thus, 
as t approaches N, the period-t inflationary 
bias net of the sum of current reversals of 
previous periods’ biases (referred to here as 
the remaining or net inflationary bias) falls 
and eventually becomes negative to make 
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TABLE 1—THE NET INFLATIONARY BIASES OVER N 


t 


N 1 2 3 4 5 6 7 8 9 10 h 
1 0.00 0.00 
2 039 -0.39 0.15 
3 0.52 0.13 —0.65 0.24 
0.59 -007 -08s 0.29 
0.62 0.43 0.17 -0.22  -1.00 0.33 
6 0.65 0.49 0.30 om . 0.36 
8 0.68 0.57 0.44 0.29 0.09 -017 -056 -1.34 0.40 
9 0.69 0.60 0.48 0.35 0.20 0.00 -026 -065 —-1.43 0.42 

10 0.70 0.62 0.52 0.41 0.28 


Notes: In this example, f =1, y* = 0.78, and 7* = 0.02. The inflationary bias when N =~ equals 0.78. For this 
example N* = 6. For the last column, b denotes the disutility from the remaining inflationary bias [i.e., the absolute 
value of the third term in equation (15)]. Note that the rows might not add to zero due to rounding. 


the average remaining inflationary bias over 
the targeting horizon zero. 

The movement in the remaining infla- 
tionary bias (i.e., the unconditional expecta- 
tion of money growth in excess of the so- 
cially determined optimal inflation rate) over 
t is given in Table 1 for 1< N<10. The 
table is constructed on the basis of the 
previous example with the additional as- 
sumptions that m*=0.02 and y* =0.78 
(implying that = 1/3). For this example, 
the inflationary bias that would emerge in 
the full-discretionary regime is 0.78 for all f. 
As revealed by the table, although the re- 
maining inflationary bias averages zero over 
the horizon, this remaining bias in absolute 
terms can exceed the bias that would emerge 
in the full-discretionary solution. In addi- 
tion, the table indicates that increases in N 
imply a greater average of the absolute value 
of the remaining inflationary biases that 
emerge over the targeting horizon due to 
the constraint (11). The basic idea here is 
similar to that behind the downward move- 
ment of the remaining inflationary bias over 
the horizon. As N increases, the targeting 
constraint (11) appears less binding to the 
monetary authority at any given period ¢. 
Accordingly, the monetary authority is more 
inclined to act on its incentive to create 
surprise inflation, which is recognized by 
wage-setters. Of course, the future reversals 
must be greater, in magnitude, if the target- 
ing constraint (11) is to be satisfied. Thus, as 


N increases, the variance of the remaining 
bias around zero increases. However, be- 
cause the remaining biases are perfectly 
anticipated by wage-setters, the increased 
variance of the remaining bias has no im- 
plications for the variation of output. 

The disutility of the remaining inflation- 
ary biases is captured by the third term in 
(15). For N =1, this term equals zero. Be- 
cause any period’s (gross) inflationary bias 
(which must eventually be reversed over the 
remainder of the targeting horizon) in- 
creases as N increases, the loss in utility 
due to the effects of the remaining infla- 
tionary bias on the variability of inflation 
around 7* increases as N becomes large. 
As N goes to infinity, this loss converges to 
(y*)?/f; in the example above, this loss, 
reported in the last column of Table 1, 
converges to 0.608. 

In contrast to the full-discretionary and 
the efficient solutions, the monetary policy 
in (14) involves only a partial accommoda- 
tion of the current shock. The targeting 
procedure limits the monetary authority’s 
flexibility to react to its forecast. However, 
as N approaches infinity, the accommoda- 
tion is full. The effect of increasing the 
monetary authority’s flexibility to react to its 
private information is partly captured by the 
fourth term in (15), which equals (1+ fo? 
for N = 1 and is decreasing in N, approach- 
ing zero as N approaches infinity. That is, 
by relaxing the monetary authority’s con- 
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straint, increasing the length of the target- 
ing horizon provides more leeway for the 
monetary authority to achieve its output 
and inflation-stabilization goals. 

The constraint in (11), however, requires 
that the accommodation in every period be 
reversed in subsequent periods. The effect 
of this requirement can be observed easily 
in (13) or (14). In particular, as the con- 
straint becomes more binding (i.e., ¢ ap- 
proaches N), the monetary authority’s reac- 
tion to the current shock approaches zero. 
The reaction to the shock in time f¢,d,, is 
reversed in increments of 


L+f 


d 


' 


in each of the remaining N-—tf periods. 
Again, that the increments of the reversal of 
a given reaction to a time-t shock are of 
equal magnitude is driven by the assump- 
tion that the monetary authority does not 
discount the future. Note that these rever- 
sals have no implications on the variability 
of output. In this model, the optimal policy, 
given by (12), implies that past money dis- 
turbances, d,, are fully revealed to wage- 
setters upon their observing g,. Thus, 
wage-setters fully incorporate these rever- 
sals into expectations. However, the rever- 
sals do influence the variability of inflation 
around 7*. 

The fifth term in (15) reflects the cost of 
the increased variability of inflation due to 
the requirement of having to reverse earlier 
reactions to shocks. As N increases from 1, 
that loss in expected utility initially in- 
creases from zero. That loss, however, de- 
creases as N becomes sufficiently large. At 
this point, increasing flexibility reduces the 
expected cost of engineering those rever- 
sals, so that the loss captured by the fifth 
term falls and finally converges to zero. 
Combining the fourth and fifth terms of (15) 
yields the net effect of limiting the scope of 
flexibility as permitted by the targeting pro- 
cedure. 

That the sum of the last terms in (15) is 
increasing in magnitude in N for N>N* 
implies that the targeting procedure be- 
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comes more inefficient as N increases above 
the optimal targeting horizon. Nevertheless, 
note that a sufficient condition for N* to be 
finite is that (1+ <(y*)?/f that 
a strict one-period targeting rule dominates 
the full-discretionary solution) as can be 
verified by comparing the expected average 
utility under the targeting procedure, given 
by (15) with N = 1, to the expected average 
utility in the full-discretionary regime, given 
by (10) or by (15) with N=. This condi- 
tion is not necessary, however. More gener- 
ally, as revealed by a comparison of (10) and 
(15), the targeting procedure will dominate 
the full-discretionary solution (N =) pro- 
vided that the magnitude of the sum of 
those terms, for any finite N, is less than 
(y*)°/f, the disutility of the inflation- 
ary bias that would emerge in the full- 
discretionary equilibrium. 

The basic idea here is as follows: as N 
goes from | to infinity, the variance of out- 
put falls, approaching that which would ob- 
tain in the full-discretionary regime. At the 
same time, the variance of inflation around 
a* due to the remaining credibility problem 
increases, approaching that which would 
obtain in the full-discretionary regime. Al- 
though increasing N also implies more flex- 
ibility to insulate inflation and output from 
money demand shocks, the required rever- 
sals of reactions to past d, adds to the 
variability of inflation around 7*, at least 
initially as N increases from 1. The variabil- 
ity due to such reversals over the targeting 
horizon falls sometime thereafter, how- 
ever, approaching zero as N goes to infinity. 
Thus, even if the strict one-period targeting 
procedure does not dominate the full- 
discretionary policy, some limits on flexibil- 
ity might be desirable. For N* to be finite 
under conditions for which the full- 
discretionary policy dominates the constant- 
money-growth rule, the disutility of the in- 
creased variability of inflation around 7* 
due to the reversals of earlier reactions to 
d, must fall soon after N increases from 1 
and approach zero at a rapid rate as N 
continues to increase. Otherwise, the reduc- 
tion in the variability of inflation around the 
socially determined rate below that which 
obtains in the full-discretionary regime will 
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TABLE 2—OPTIMAL TARGETING Horizons, N* 


f 
y*/oq 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 
0.4 9 00 00 00 00 00 00 00 oo 
0.5 1 3 13 re) 00 
0.6 1 2 4 16 00 oo oo oo 
0.7 1 1 5 19 00 00 
0.8 1 1 1 3 6 22 90 oo oo 00 
1.0 1 1 1 1 Z 3 i 28 oo oo 
1.1 1 1 1 1 i} 2 3 8 30 oo 
1.2 j1 1 1 1 1 1 2 4 8 32 
1.3 1 1 1 1 1 1 1 2 4 9 
1.4 1 1 1 1 1 1 1 2 2 4 
i 1 1 1 1 1 1 1 1 2 Z 
1.6 1 1 1 1 1 1 1 1 1 2 
7 1 1 1 1 1 1 1 1 1 1 


Note: Strictly speaking, N* =~ indicates that the full-discretionary solution dominates 
only those for finite multiperiod targeting procedures with N < 200. 


not be sufficient to compensate for the in- 
creased variability in output induced by the 
targeting constraint; in this case, no con- 
straints on policy through the targeting pro- 
cedure would be desirable. 


II. Optimal Targeting Horizons: 
Simulations of the Model 


The optimal degree of flexibility depends 
on the accuracy of the monetary authority’s 
forecasts of money demand disturbances, as 
measured by a7 for a given a? or a given 
o,, and the severity of the credibility prob- 
lem, as measured by y*/f. As in much of 
this literature, however, analytically deriving 
the optimal targeting horizon, N*, that 
maximizes expected average utility, as given 
by (15), is not possible despite the simplicity 
of the model. This section summarizes the 
results of simulations that specify the two 
key parameters of the model: y* /o, (the 
weighted difference between the player’s 
output goals relative to the magnitude of 
the predictable part of the shock) and f 
(the ratio of the weight attached to inflation 
stability relative to output stability in the 
monetary authority’s preferences, s, to the 


squared elasticity of output with respect to 
unanticipated inflation, @). For various pa- 
rameter specifications, N* is reported in 
Table 2. Discrete jumps in N*, as the pa- 
rameter values vary. are due to the fact that 
changes in the parameter values are not 
sufficiently small. 

A close inspection of the table reveals 
two important implications of the simula- 
tion exercise. First, for a given y* /a,, as f 
increases, the optimal targeting horizon gets 
longer. That is, as the weight of importance 
that the monetary authority attaches to its 
inflation objective relative to that for its 
output objective increases, the optimal 
length of the targeting horizon increases.!” 


'7Note that an analogous comparative-static exer- 
cise involving a hypothetical decrease in the elasticity 
of output with respect to unanticipated inflation, 6, 
would not necessarily produce the same result. A 
smaller @ implies not only a larger f, but a larger y*. 
As we will discuss, a larger y*, for a given f, implies a 
smaller N*. Thus, hypothetical changes in f should be 
interpreted as changes in s only. Similarly, hypotheti- 
cal changes in y* should be interpreted as changes in 
either k or y" only. 
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The basic idea behind this implication is 
rather simple. As f increases, the monetary 
authority’s incentive to push output beyond 
its natural level falls so that the inflation- 
ary bias that would emerge in the full- 
discretionary equilibrium falls. Accordingly, 
the optimal scope of flexibility under the 
targeting procedure increases. For large val- 
ues of f, the optimal targeting solution boils 
down to the full-discretionary solution, in 
which there is complete flexibility. In this 
case, the inflationary bias that emerges un- 
der full discretion becomes so trivial that 
the limits on flexibility imposed by the tar- 
geting procedure are unnecessary. 

This implication is similar to that of Ro- 
goff's (1985) analysis. In contrast to the 
present analysis, however, Rogoff distin- 
guishes the monetary authority’s prefer- 
ences from that of society and thereby gives 
an institutional interpretation to the mone- 
tary authority’s preferences.'* The differ- 
ence emerges in the weight of importance 
attached to the inflation goal relative to the 
output goal; for convenience, the difference 
is denoted here by n= f,, — f > 0, where f 
reflects society’s preferences and f,, reflects 
the (perverse) monetary authority’s prefer- 
ences. Nevertheless, as in Rogoff’s analysis, 
the inflationary bias that emerges in the 
full-discretionary outcome becomes less se- 
vere, as the monetary authority’s dislike for 
deviations of actual inflation away from the 
socially determined optimal rate increases. 
The important distinction between the 
present analysis and that of Rogoff lies in 
his interpretation of flexibility. Specifically, 
in a one-shot-game setting, he roughly inter- 
prets increasing the monetary authority’s 
relative concern for inflation stability around 
m* (i.e., increasing f,, for a given f) as 
limiting flexibility, for it implies less com- 
plete and, for a given f, less efficient ac- 
commodations of supply shocks by the mon- 
etary authority.'!? In contrast to Rogoff 


'8also see Lohmann (1992). For a general survey on 
this and related issues, see Torsten Persson and Guido 
Tabellini (1990). 

‘Under this interpretation, then, the fully flexible 
policy is that which would otherwise be implemented 
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(1985), the present multiperiod analysis in- 
terprets increases in flexibility in terms of 
increasing N, finding that the more perverse 
is the monetary authority (i.e., the larger 
is f) the greater are the advantages of full 
flexibility in monetary policy. 

The second implication of the simulation 
exercise is that, for a given f, the optimal 
targeting horizon becomes shorter as y* /a, 
increases. There are two potential forces at 
work here. As y* increases for a given a, 
the difference between the socially optimal 
output target and the natural level of out- 
put becomes larger; as a consequence, the 
inflationary bias that would emerge in the 
full-discretionary solution becomes larger, 
thereby detracting from the benefits of flex- 
ibility afforded by the targeting procedure. 
Furthermore, as a, falls for a given y*, the 
expected value of reacting to current shocks 
(the monetary authority’s private informa- 
tion) to stabilize inflation and output falls. 
Since a7 = 07-02, a decrease in o7 can 
be thought of as a decrease in of or as an 
increase in o,”. In either case, the reduction 
in a; implies a reduction in the degree of 
accuracy of the monetary authority’s fore- 
cast, oj /o?. Not surprisingly, then, the 
simulation exercise suggests that, with a 


by the policymaker with preferences corresponding ex- 
actly to those of society. In her interesting analysis of 
what incentive structure should be imposed on the 
monetary authority, Lohmann (1992) adds another di- 
mension to this interpretation of flexibility with a 
notion of central-bank independence—in particular, 
independence of the monetary authority from the poli- 
cymaker having preferences identical to those of soci- 
ety. Building on Flood and Isard (1989), her notion of 
independence revolves around the possibility that the 
policymaker can, at a cost, dismiss the monetary au- 
thority and implement the full-discretionary policy. In 
equilibrium, the monetary authority follows a policy 
involving a flexible escape clause, whereby it can avoid 
dismissal. The smaller the cost of dismissal, the smaller 
is the degree of independence and, for a given f,,, the 
closer is the monetary authority’s reaction to the shock 
to what the policymaker’s reaction would be if it were 
to formulate and otherwise implement policy directly. 
Thus, for a given f,,, a decrease in independence could 
be interpreted as an increase in flexibility. (Note that 
Rogof?’s [1985] analysis implicitly assumes that the cost 
of dismissal is infinite, implying no independence.) 


reduction in the monetary authority’s fore- 
casting ability, the optimal limits on flexi- 
bility are greater (i.e., N* is smaller). The 
strict one-period targeting rule is more likely 
to be optimal as y*/a,, rises. 

This implication further distinguishes the 
results of the present analysis from those of 
Rogoff (1985). Specifically, Rogoff finds that 
some flexibility in monetary policy is invari- 
ably better than none (i.e., a constant- 
money-growth rule or, in terms of their 
analyses, an extremely perverse monetary 
authority with » =~).”° In contrast, the pre- 
sent analysis finds that a constant-growth 
rule (N* =1) can emerge as the optimal 
policy for a wide range of parameter values. 
There are two reasons for this distinction. 
First, the present analysis explicitly consid- 
ers the monetary authority’s forecasting 
ability or lack thereof. Reducing the mone- 
tary authority’s ability to predict money de- 
mand shocks, which should be accommo- 
dated to stabilize inflation and output, 
weakens the case for maintained flexibility 
in policy. Second, the present analysis al- 
lows for the possibility of extending the 
one-period game setting to a multiperiod 
game setting where, through the targeting 
constraint, the monetary authority can par- 
tially influence future expectations through 
its current policy. Even if there were supply 
shocks, as in Rogoff’s model, this distinction 
would emerge. Nevertheless, for those pa- 
rameter values under which the present 
analysis finds that N* =, there could be 
room for improvement by limiting flexibility 
through the alternative institutional ar- 
rangements studied by Rogoff (1985) and 
Lohmann (1992): specifically, through the 
appointment of a perverse central banker 


Similarly, Lohmann (1992) finds that complete in- 


dependence is not optimal (see footnote 19). It should 
be noted, however, that both Rogoff (1985) and 
Lohmann (1992) also find that some limits on flexibility 
are always better than none (i.e., 1 =0 is never opti- 
mal). (Lohmann also finds that some independence is 
always better than none.) This distinction emerges 
since, in their analyses, limited flexibility (4 > 0) does 
not constrain reactions to money demand disturbances. 
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with some degree of independence. More 
generally, as noted in Section III, it might 
be possible to combine these institutional 
arrangements with the multiperiod targeting 
procedure to effect a more efficient balance 
between credibility and flexibility. 

It is difficult to interpret the simulation 
results directly in terms of actual data be- 
cause the difference in output goals is mea- 
sured in (log) levels, and the scale that 
would be appropriate is not clear. Interpret- 
ing the simulation results is more difficult 
without knowledge of the value of the pa- 
rameter f. Nevertheless, Table 2 indicates 
that the full-discretionary solution domi- 
nates any finite targeting procedure for a 
large part of the parameter space.”! How- 
ever, for those parameter values in which 
there is room for improvement by imposing 
a binding constraint on monetary policy, the 
table illustrates that, often, N* =1. This 
result is rather interesting, considering the 
fact that, in many nations where central 
banks have adopted some form of monetary 
targeting procedure, the targeting horizon 
coincides with the wage contract period, 
typically one year. 

The result is also illustrated by Figure 2, 
which depicts regions for N* = 1, 2, 3, and 4 
and for N* > 5 in terms of combinations of 
f and y*/o,. Even with this limited oppor- 
tunity set, the figure shows that the regions 
for which N* = 1 and N* > S are the largest 
areas among those under consideration. 
More generally, by sequentially increasing 
the opportunity set and deriving explicit re- 
gions for N* > 5 (e.g., N* =5, N* =6, etc.), 
it is possible to see that regions for N* 
become progressively narrower as N* <o 
increases. Hence, assuming that N* is fi- 
nite, the optimal degree of flexibility is likely 
to be extremely limited if not entirely elimi- 
nated. Of course, this claim implicitly as- 
sumes a uniform distribution for the actual 


*IAdditional simulations confirm this interpretation 
of the table. Moreover, they show that, for f > y* /a,, 
the full-discretionary solution dominates all finite tar- 
geting procedures with N < 1,000. 
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values of the parameters. Without knowl- 
edge of the empirical distribution of the key 
parameters, the figure shows only the condi- 
tions (in terms of combinations of parame- 
ter values) under which certain regimes 
would be predicted to emerge by the pres- 
ent analysis. In any case, the simulation 
exercise provides additional support to the 
notion that it is important to take into ac- 
count the credibility problem, in addition to 
the monetary authority’s forecasting ability, 
when designing feasible optimal rules for 
monetary policy. 

One important caveat to the above analy- 
sis should be noted. In particular, the as- 
sumption that d, is not serially correlated is 
a strong assumption. Indeed, the extreme 
results above (i.e., given N* is finite, ex- 
tremely limited flexibility is often desirable) 
might be driven partly by the transitory na- 
ture of the monetary authority’s private in- 
formation. While the effect of persistence of 
shocks to money demand on the optimal 
targeting horizon is not obvious in the con- 
text of this model, the analysis by Cukier- 
man and Meltzer (1986) suggests that the 
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introduction of such persistence is likely to 
increase the monetary authority’s desire for 
maintained flexibility. In particular, their 
analysis shows how persistence in the mone- 
tary authority’s private information, which is 
never directly observed by the public, pro- 
vides a channel through which the monetary 
authority can influence (indirectly and im- 
perfectly) future expectations, without im- 
posing any explicit limits on flexibility. Per- 
sistence, in the context of this model, then 
might imply a smaller role for a restrictive 
targeting procedure as an imperfect mecha- 
nism by which the monetary authority can 
influence future expectations indirectly. 
However, the qualitative results obtained 
above are not sensitive to the assumption 
that the monetary authority does not dis- 
count the future. That is, N* is increasing 
in f and decreasing in y*/o,, even when 
the monetary authority discounts the future. 
In the case of discounting, under a given 
N-period targeting constraint, the monetary 
authority’s willingness to act on its incentive 
to create surprise inflation would be greater, 
and its optimal policy would involve more 
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complete accommodations of the predicted 
part of the money demand disturbances, 
since future reversals are perceived as being 
less of a burden. As such, when the mone- 
tary authority discounts the future, a given 
N-period targeting solution retains more 
flexibility but does not gain as much credi- 
bility over the full-discretionary solution, 
relative to when the monetary authority does 
not discount the future. An examination of 
the utilities obtained under various N- 
period targeting constraints when the mone- 
tary authority discounts the future suggests 
that, as the discount rate increases, the op- 
timal targeting horizon, N*, falls for rela- 
tively higher values of y*/o, and smaller 
values of f; but, for relatively smaller values 
of y*/o, and larger values of f, an in- 
crease in the discount rate will imply an 
increase in N*. 


III. Concluding Remarks 


This paper has investigated the efficacy of 
average monetary targeting to reduce the 
inflationary bias that otherwise emerges as a 
result of the monetary authority’s incentive 
to surprise agents in an effort to increase 
output beyond its natural level. Reputa- 
tional considerations and the imposition of 
a contingent rule, as reasonable methods of 
eliminating the bias while retaining flexibil- 
ity for output and inflation stabilization, are 
called into question when the monetary au- 
thority has some private information and 
there is no mechanism to enforce a truthful 
dissemination of that information. Provided 
that the inflationary bias is sufficiently large, 
an average monetary targeting procedure to 
eliminate the bias will be appealing. As the 
targeting horizon gets longer, the amount of 
flexibility permitted by the procedure in- 
creases, but at the cost of a higher variance 
of the unconditionally expected inflation 
rate around its socially determined optimal 
rate. Hence, the optimal targeting horizon 
defines the optimal trade-off between the 
cost of limited flexibility and the cost of the 
remaining inefficiencies of the credibility 
problem. 

As indicated earlier, that the optimal tar- 
geting horizon is often very short might be 
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accounted for, in part, by the transitory 
nature of the monetary authority’s private 
information. Thus an interesting extension 
of the analysis, left for future research, in- 
volves giving that information some persis- 
tence and examining the degree to which 
such persistence will increase the monetary 
authority’s desire for maintaining some de- 
gree of flexibility. Also, given this persis- 
tence, the analysis could consider the possi- 
bility that the monetary authority might 
employ Crawford and Sobel’s (1982) con- 
cept of cheap talk to reveal its private infor- 
mation, at least partially. Specifically, the 
monetary authority could make an impre- 
cise announcement (in terms of a range) 
that partially reveals the persistent (perma- 
nent) component of its private information. 
Accordingly, the monetary authority could 
keep the average equilibrium inflationary 
bias low in absolute terms throughout the 
horizon and, at the same time, enhance the 
scope of flexibility permitted by the target- 
ing procedure. While the optimal targeting 
horizon, N*, would be determined from the 
stationary part of the model, the degree of 
precision of the announcements (in terms of 
the number and width of the ranges) would 
be determined on the basis of the persistent 
component of the monetary authority’s pri- 
vate information. 

Another interesting extension left for fu- 
ture research involves the introduction of 
(verifiable) supply shocks as in Rogoff 
(1985), Flood and Isard (1989), and 
Lohmann (1992) into the model. The richer 
model specification in a multiperiod setting 
would permit a fruitful investigation of the 
merits of the multiperiod targeting proce- 
dure relative to alternative institutional so- 
lutions—in particular, Rogoffs perverse 
policymaker solution, Flood and _ Isard’s 
mixed-strategy solution involving a simple 
constant-money-growth rule with an escape 
clause to permit temporary (i.e., one-period) 
reversions to the full-discretionary policy 
upon the realization of a seldom experienced 
shock, and a combination of those two solu- 
tions as suggested by Lohmann. Moreover, 
the analysis could determine the extent to 
which these alternative institutional ar- 


rangements could be supplemented by the 
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multiperiod targeting procedure to achieve 
a more efficient balance between flexibility 
and credibility. 


APPENDIX 


This appendix sketches a few steps of the 
method used to solve the monetary author- 
ity’s optimization problem subject to the 
N-period average target constraint. (For 
simplicity, the tilde (“ ~”) notation is sup- 
pressed here.) In period t = N, the mone- 
tary authority has no choice. Its policy is 
determined by (11) in the main text, given 
the past sequence {g,}\{'. In period t= 
N —1, it solves the following problem, rec- 
ognizing the impact its policy in the current 
period t = N —1 has on future expectations, 
and subject to (11) in the main text: 


N 
(Al) [-f(g,-8,-7*) 
8N-1 


-8,-y*y]}. 


In light of the constraint on its policy in the 
next period (t = N), the monetary authority 
does not expect that it can create surprise 
inflation in the last period. Since (11) im- 
plies gx = gy, the last term for t=N in 
(A1) is irrelevant in the optimization prob- 
lem. The expectation of the first-order con- 
dition, conditional on d,_, is given by 


(A2) 


N-1 


t=1 
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where (11) has been used. Taking the 
wage-setters’ expectation of (A2) and rear- 
ranging yields 


t=1 


N-2 
3) «) [2 


The solution to the monetary authority’s 
problem in period t = N —1 is obtained by 
substituting (A3) back into (A2): 


1 
(A4) Atf 


where g\,_, is given by (A3). 
In period t = N —2, the monetary author- 
ity solves a similar problem: 


max 
EN \¢ 


[- 


2 
~(g,- 


subject to (11). Under the assumption that 
the monetary authority recognizes the im- 
pact that its current policy has on future 
expectations, the second terms for t= N 
and ¢ = N-—1 in the optimization problem 
for period t = N —2 are irrelevant. Because 
the monetary authority expects to imple- 
ment the policy in (A4) in the next period 
(t= N-—1) and believes that the wage- 
setters’ expectations of its future (t = N —1) 
policy are consistent with (A4), it recognizes 
that it cannot create inflation surprises then 
or any time in the future with its current 
policy. Using (A3), (A4), and (11) and rear- 
ranging, the problem becomes equation 
(A5), below. Since E,_{d,]J=0 and 


/ 


N 


_ 
N 
| 
1+f 
\" 
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Ey_6}=0 for s>N-—2, the first-order 
condition to this problem implies 


N-3 
(A6) ah 


t=1 


and 


2(1+ f) 


(A7) 8y-2= 8y-2+ 2+3f dy_>- 


Similarly, the optimization problem for pe- 
riod t = N —3 has the following solution: 


N-4 
(A8) gy_3=|Na*+3y*/f- 
t=1 
3(1+ f) 


(A9) 8N-3+ N-3: 


Repeating this exercise for periods t= 
N — p, one can verify that the general solu- 
tion to this problem is given by 


N-p-1 


p(i+ f) 
p+(p+1)f 


(A10) &N-p= [vee + py*/f- 


and 
(A11) + 


for p =1,2,3,..., N—1. The solution given 
in the text, equation (12), follows from (A10) 
and (A11), noting that p= N —t. [A proof 
via mathematic induction that (12) is the 
solution is available from the authors upon 
request.] 
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Why Capital Hires Labor: 
A Bargaining Perspective 


By Grecory K. Dow* 


Organizational forms often serve as vehicles for the appropriation of quasi-rent. 
Capitalist firms typically emerge when production requires noncontractible in- 
vestments in specific physical assets, because worker control would divert 
quasi-rents away from asset owners ex post. Conversely, labor-managed firms 
tend to emerge in niches requiring specialized human capital but general-purpose 
physical assets. A key result is that capitalist firms can persist in competitive 
markets even when labor-managed firms would yield a larger total surplus. 


(JEL C78, D23, J54, P13) 


Several decades have passed since Paul 
Samuelson’s (1957) famous remark that in a 
competitive economy it does not matter 
whether capital hires labor or the reverse. 
This claim is surely correct if capital and 
labor are regarded as nothing more than 
abstract inputs in a production function. 
But in reality, the assignment of residual 
claimant status to capital or labor is not a 
matter of indifference. Capital-managed 
firms (KMF’s) are much more common than 
labor-managed firms (LMF’s) in Western 
economies. Except for occasional employee 
buy-outs of capitalist firms, LMF’s tend to 
cluster in professional, craft, or service ac- 
tivities where little specialized physical capi- 
tal is required (Raymond Russell, 1985; 
Avner Ben-Ner, 1988; Henry Hansmann, 
1990). 

Many different reasons have been sug- 
gested for the marginal status of the LMF. 
The early literature emphasized compara- 
tive-static perversities resulting from the (al- 
leged) maximization of net revenue per 
worker (Benjamin Ward, 1958; Evsey Do- 


*Department of Economics, University of Alberta, 
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Rick Szostak, and two anonymous referees provided 
helpful advice on earlier drafts. Financial assistance 
was supplied by the Social Sciences and Humanities 
Research Council of Canada. 
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mar, 1966; Jaroslav Vanek, 1970). Other 
writers have stressed problems of subopti- 
mal investment deriving from the inability 
of current LMF members to capture future 
profits (Eirik Furubotn, 1976; Michael 
Jensen and William Meckling, 1979). These 
defects can all be remedied, at least in 
principle, by making LMF membership 
rights marketable. This institutional design 
motivates LMF members to maximize profit 
or present value rather than income per 
worker. LMF’s thus become isomorphic to 
capitalist firms in all behavioral respects 
(Murat Sertel, 1982, 1991; Dow, 1986, 1992, 
1993a; Ernst Fehr, 1993). In light of this 
analytic result, the rarity of LMF’s in mar- 
ket economies remains puzzling.' 

The comparison between capitalist and 
labor-managed firms is a _ special but 
paradigmatic case of a broader question: 
given free entry, why do some organiza- 
tional forms survive in equilibrium while 
others do not? One plausible hypothesis is 
that surviving modes of organization yield a 
larger total surplus than potential rivals and, 
therefore, prove more attractive to factor 


‘Other explanations for the dominance of the capi- 
talist firm appeal to informational asymmetries. The 
classic reference on this subject is Armen Alchian and 
Harold Demsetz (1972); for a critical review see Louis 
Putterman (1984). Recent authors have particularly 
stressed the role of moral hazard between lenders and 
borrowers (Henry Hansmann, 1988; Mukesh Eswaran 
and Ashok Kotwal, 1989; Herbert Gintis, 1989). 
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suppliers. This is essentially the explanation 
offered by transaction-cost economics 
(Oliver Williamson, 1985). 

In fact, there is no necessary relationship 
between the size of the surplus generated 
by an organizational form and its long-run 
viability. To see why, suppose that produc- 
tion requires an initial investment in a spe- 
cialized asset, and assume that this invest- 
ment is noncontractible (perhaps it is too 
costly to write down all technologically rele- 
vant characteristics of the asset). Now con- 
sider two possible organizational forms, A 
and B. Firms of type A maximize aggregate 
surplus once the specialized asset is in place 
but make it difficult or impossible for 
owners of the asset to capture quasi-rent 
ex post. Firms of type B yield a smaller 
aggregate surplus but direct most of this 
surplus to the asset’s owners. Since type-A 
firms are unattractive to potential investors, 
organizers of such firms are disadvantaged 
relative to type-B rivals. This liability would 
disappear if the prospective quasi-rent re- 
cipients in type-A firms could bribe in- 
vestors to participate. However, by assump- 
tion, one cannot write contracts under which 
the asset is supplied in exchange for side 
payments. Given this constraint, the ease 
with which asset owners can appropriate 
quasi-rent in a type-B firm proves decisive. 
Other things equal, firms of this type will 
prevail in equilibrium because they offer 
higher returns to investors.’ 

The application to labor-managed firms is 
straightforward. Suppose that the specific 
asset is a factory and workers cannot ac- 
quire a factory directly because collective 
capital ownership is impractical. LMF mem- 
bers could nevertheless attempt to lease a 
factory from external owners. In their man- 
agerial capacity the LMF workers would 
unilaterally choose output levels, working 
conditions, and the intensity of equipment 
use. These decision rights could enable 


2A similar logic operates in models in which ex post 
diversion of quasi-rent to unionized workers depresses 
investment in the capitalist firm (Kim Clark, 1984; Paul 
Grout, 1984; John Addison and Barry Hirsch, 1989). 
This effect would vanish if workers and owners could 
contract on investment levels. 
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workers to transfer substantial quasi-rent to 
themselves, at the expense of the outside 
owners. Investors who foresee this problem 
will be reluctant to build a factory in the 
first place. One possible solution is for the 
LMF to extend ex ante bribes to potential 
investors in exchange for construction of the 
factory. However, workers might not be able 
to write down all desired characteristics of 
the factory, or the courts might be unable to 
verify whether contractual specifications 
were met. An alternative solution is to let 
the factory owners manage the firm them- 
selves and capture quasi-rents as residual 
claimants. While transferring decision rights 
from workers to owners might reduce total 
surplus, the factory would at least be built.° 

Parallel appropriability dilemmas crop up 
whenever specific investments are noncon- 
tractible and the firm’s decision-making 
structure affects the distribution of ex post 
quasi-rent. Due to the noncontractibility 
problem, ail investors must be willing to 
participate even in the absence of side pay- 
ments. Organizations in which this ex ante 
participation constraint is violated will not 
flourish in the long run, regardless of their 
potential ex post productivity (Dow, 1993b). 

In this paper I will sketch out one mecha- 
nism through which the locus of authority 
within the firm could influence quasi-rent 
distribution and thus the viability of alterna- 
tive organizational forms. The preceding ar- 
gument about appropriability problems is 
quite genera! and does not depend upon the 
empirical validity of the particular mecha- 
nism discussed here, which involves bilat- 
eral bargaining between capitalists and 
workers. The bargaining model is of some 
interest, however, because it identifies cir- 
cumstances under which equilibrium organi- 
zations fail to maximize aggregate wealth. It 
is also consistent with the observed distribu- 
tion of KMF’s and LMP’s across industries. 


‘After the factory has been built, workers might try 
to bribe its owners to adopt the LMF structure if this 
would increase aggregate surplus. However, once the 
firm is reorganized as an LMF the external asset own- 
ers will have little incentive to maintain the firm’s 
capital stock (see Section V). 
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I. Bargaining and Organizational Form 


The model has the following structure. In 
the KMF, owners of physical assets are the 
residual claimants, hire workers, and choose 
output levels. These roles are reversed in 
the LMF. Once specialized assets are in 
place, the hired input bargains over its own 
remuneration in each organizational form 
(the wage in the KMF, the rental fee paid 
to asset owners in the LMF). At the end of 
this bargaining procedure, the residual 
claimant unilaterally selects an output. The 
output chosen depends on the compensa- 
tion paid to the hired input through a par- 
ticipation constraint. 

I then show that there are equilibrium 
wage bargains in which the KMF does not 
maximize total surplus, but owners and 
workers both obtain positive quasi-rent 
shares. The corresponding LMF bargaining 
process induces surplus maximization at the 
production stage, but all of the resulting 
quasi-rent is appropriated by the firm’s work 
force. The rest of the argument proceeds as 
before. If physical assets are specialized and 
noncontractible, the KMF prevails because 
it allows capital suppliers to recoup their 
sunk setup costs, even though total surplus 
is greater in the LMF. The converse also 
holds: when physical assets are general- 
purpose but it is impossible to shift the cost 
of investments in specific human capital to 
employers, LMF’s can prevail in equilib- 
rium even though they fail to maximize ag- 
gregate surplus. 

Explanations of the capitalist firm which 
rely on the notion of bargaining power are 
frequently regarded as analytically loose and 
faintly disreputable (Williamson, 1980; 
David Landes, 1986; Eswaran and Kotwal, 
1989). A brief defense of this approach is 
therefore in order.t Modern bargaining 
models rigorously derive the relative power 


*For related applications of bargaining theory to 
questions of organizational form, see Dow (1985, 1988, 
1989). 
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of agents from their rates of time prefer- 
ence or attitudes toward risk (Martin Os- 
borne and Ariel Rubinstein, 1990). For ex- 
ample, workers might place less weight on 
future income than capitalists because they 
are rationed in the credit market (Joseph 
Stiglitz and Andrew Weiss, 1981). If so, they 
will be handicapped in wage-bargaining. 
Similarly, workers who cannot diversify their 
wealth holdings might be more risk-averse 
than capitalist investors and hence more 
fearful of a breakdown in negotiations (Jan 
Svejnar, 1986). Any such reluctance to bear 
risk will also undermine the bargaining po- 
sition of workers. 

Transaction-cost economics and the bar- 
gaining approach share an emphasis upon 
quasi-rent appropriation. However, they op- 
erate at different levels of analysis. In the 
view of transaction-cost theorists, organiza- 
tional forms economize on total production 
and transaction costs (Williamson, 1985). 
Consequently, the central optimization 
problem is formulated at the social rather 
than the individual level. The noncoopera- 
tive bargaining perspective, by contrast, re- 
jects axioms of collective rationality in favor 
of a commitment to individual rationality 
(Rubinstein, 1987). In this approach the firm 
is a nexus of strategies rather than a nexus 
of contracts. Organizational forms are 
shaped by strategic behavior on the part of 
input suppliers, not just an overarching ob- 
jective of wealth-maximization (Dow, 1987, 
1993b; Williamson, 1987). 

Apart from the assumption of noncon- 
tractible investment, I will need to intro- 
duce some further contracting frictions in 
the course of the bargaining analysis. 


(a) Output is noncontractible.—Without this 
assumption, factor suppliers would al- 
ways choose output to maximize surplus 
and then bargain over the division of 
the resulting quasi-rent. For similar rea- 
sons, it is necessary to rule out contracts 
in which all output-related costs borne 
by hired inputs are transferred to the 
residual claimant. I therefore assume 
that payments to hired inputs are invari- 
ant with respect to output (e.g., no piece 
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rates). This restriction is appropriate 
when the courts cannot verify output 
quality, for example. As will be ex- 
plained in Section II, the key analytic 
results also go through when the resid- 
ual claimant exercises authority over a 
noncontractible variable other than out- 
put, provided that suppliers of the hired 
input incur an associated cost which 
cannot be shifted to the residual 
claimant. 

(b) Hired inputs cannot promise to forgo bar- 
gaining in exchange for side payments.— 
Section II will show that total surplus 
varies with the wage. This effect would 
be of no interest, however, if input sup- 
pliers could simply adopt the wage that 
maximizes overall surplus and then di- 
vide the resulting pie through side pay- 
ments. I exclude this scenario by assum- 
ing that if owners bribe workers to ac- 
cept such deals, workers can simply 
pocket the bribe and restart negotia- 
tions without penalty. Under these con- 
ditions it is impossible for agents to 
avoid the allocative consequences of 
wage-bargaining. By the same token, | 
will assume that LMF’s cannot bribe 
owners of physical assets to refrain from 
opportunistic bargaining over the fee 
charged for the use of their assets. 
Capital and labor supply cannot be inte- 
grated.—Bargaining inefficiencies be- 
tween buyers and sellers are often 
avoided through the device of vertical 
integration (Benjamin Klein et al., 1978; 
Williamson, 1985; Sanford Grossman 
and Oliver Hart, 1986; Hart and John 
Moore, 1990). Similar inefficiencies 
could be avoided here either by having 
capitalists supply labor to their own firms 
(Eswaran and Kotwal, 1989) or by hav- 
ing the firm’s work force collectively ac- 
quire its physical assets (Avner Ben-Ner 
and Byoung Jun, 1990). In what follows, 
I assume that the first of these options 
is excluded by the scale of production or 
the owner’s demand for leisure. Worker 
takeovers are ruled out by liquidity con- 
straints, informational asymmetries, or 
problems associated with collective cap- 
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ital ownership.° As a result, capital and 
labor are supplied by distinct agents. 


Sections II and III develop the bargaining 
analysis for the KMF. Section II derives the 
optimal KMF output at an exogenously 
given wage, assuming that asset owners 
choose output subject to a worker-participa- 
tion constraint. Section II] endogenizes the 
wage using a Rubinstein alternating-offers 
bargaining framework. With reasonable re- 
strictions on the bargaining strategies, I 
show that two perfect equilibria can arise. 
The first is the standard Rubinstein solu- 
tion, in which quasi-rent is divided accord- 
ing to the time preferences of the factor 
suppliers. Output is excessive in this equi- 
librium because the firm’s owners fail to 
internalize effort costs borne by workers. In 
the other equilibrium, output is optimal, 
workers receive their reservation payoffs, 
and all quasi-rent flows to the owners of the 
firm. A dominance result establishes that 
the Rubinstein solution should be selected 
when both equilibria exist. 

Section IV exploits the symmetry of the 
KMF and LMF in order to characterize 
market equilibrium. Assuming that invest- 
ments in specialized assets are noncon- 
tractible, residual claimant status usually 
goes to the suppliers of such inputs. Equi- 
librium modes of organization may maxi- 
mize total surplus; but equilibrium can also 
involve excessive output by each firm, too 
little entry, and a diversion of quasi-rent to 
hired inputs. In particular, KMF’s can pre- 
vail even when LMF’s would generate a 
larger total surplus. Section V_ discusses 
some contractual devices that might pro- 
mote the emergence of wealth-maximizing 
organizational forms. 


>I show elsewhere (Dow, 1993a) that LMF’s with 
collective capital ownership can generate a larger static 
and dynamic surplus than KMF’s, while nevertheless 
growing more slowly and providing a lower private 
value per unit of capital. 


122 THE AMERICAN ECONOMIC REVIEW 


II. Output Decisions in the Capitalist Firm 


Call the factor suppliers in the capitalist 
firm K and L. These labels denote coalitions 
of individual agents: for example, share- 
holders and a labor union. I abstract from 
issues of input substitution by assuming a 
Leontief production technology. Labor is 
supplied to the firm with infinite elasticity 
ex ante, and the reservation utility of work- 
ers is normalized at zero. Once K has re- 
cruited L, however, some combination of 
hiring and firing costs leads to a condition 
of bilateral monopoly (Robert Solow, 1985; 
Assar Lindbeck and Dennis Snower, 1988).° 

At the bargaining stage, K and L negoti- 
ate the fixed wage L will receive when pro- 
duction occurs. K then chooses an output 
unilaterally at the production stage, subject 
to the constraint that L’s payoff be nonneg- 
ative.’ K’s payoff as the residual claimant is 
revenue minus depreciation on the firm’s 
capital stock and the wage paid to L. L’s 
payoff as a hired input is the wage minus 
the disutility of effort. Overall surplus de- 
pends on K’s choice at the production stage 
because depreciation costs and the disutility 
of effort vary with the output of the firm. 

Since K does not directly internalize ef- 
fort costs, there is an incentive for overpro- 
duction. If this tendency is restrained at all, 
it is only by L’s participation constraint. 
When the wage is high enough, the partici- 
pation constraint is irrelevant because L has 
a positive payoff even at K’s preferred out- 
put. Firm output is consequently excessive. 
At lower wage levels, the constraint binds, 
and output is smaller. Through a suitable 
wage choice, K can be induced to adopt the 
surplus-maximizing output level. However, 
L’s payoff must be zero if surplus is maxi- 


°Even when L supplies only unskilled labor, K is 
effectively locked into a bilateral monopoly position if 
it is costly to replace L ex post, and K’s existing 
bargaining position is precisely replicated after a re- 
placement work force has been recruited (Gil Skillman 
and Harl Ryder, 1988). 

If output were chosen prior to (or jointly with) the 
wage, the wage would merely divide a fixed total sur- 
plus between K and L. The resulting wage bargain 
would depend upon output, an implicit violation of 
assumption (a) in Section I. 
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mized because the labor-participation con- 
straint always binds in this situation. 

The use of output as the residual 
claimant’s choice variable is suggestive, but 
not essential to the analysis. The key point 
is that the residual claimant exercises man- 
agerial authority over some noncontractible 
variable (perhaps production technique, or 
working conditions) which affects costs 
borne by the hired input. If these costs can 
only be internalized by the residual claimant 
through a binding participation constraint, 
then maximization of total surplus necessar- 
ily requires that the hired input receive no 
quasi-rent. The bargaining analysis of Sec- 
tion III applies with minor modifications to 
any model of this general type. 

The utility functions at the production 
stage are 


(1) ux(4.w) = pq-cx(q)-w 


where q > 0 is output and w > 0 is the wage. 
The output price p is parametric for an 
individual firm but will be endogenized when 
market equilibrium is discussed in Section 
IV. The function c,(-) captures deprecia- 
tion on physical assets, while c,(-) is the 
disutility of work effort. 

The sum of these payoffs is given by. the 
surplus function 


(2) 
=Ux(q,w)+u,(q,w). 


This is the total quasi-rent available to K 
and L. In a world of complete contracts the 
firm would choose output to maximize total 
surplus, treating c,(q)+c,(q) as its cost 
function. I depart from this framework by 
assuming that the costs cy, and c,; must be 
borne privately by the respective input sup- 
pliers. A capitalist firm cannot arrange con- 
tracts under which the firm’s owners bear 
the disutility of effort, and a labor-managed 
firm cannot sign contracts with external as- 
set owners under which its membership 
bears depreciation costs. 


3 
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The cost functions are twice continuously 
differentiable and satisfy the following tech- 
nical requirements: 


ASSUMPTION 1: 
cx(0) =c,(0) =0 
ASSUMPTION 2: 
= c4(0) = 0 
ASSUMPTION 3: 
ci(q) > 0, > 0 


alla>0 i=K,L 


ASSUMPTION 4: 


lim c,(q) = lim 


The surplus function is shown in Figure 1. 
Surplus is strictly concave in output, rising 
from zero to a maximum at q* and then 
returning to zero at qy. At larger outputs, 
surplus is negative. 

Taking the wage as given at the produc- 
tion stage, K chooses output unilaterally to 
solve 


max {pq ¢x(4) w} 


subject to c,(q)<w. At high wage levels, 
the participation constraint does not bind. 
The resulting unconstrained optimum will 
be denoted by Gy. This output level is ex- 
cessive because K ignores effort costs. At 
low wage levels, gy, is infeasible, and the 
constraint binds. Writing K’s output choice 
as q,(w), one therefore has 


(3) ax(w)=cr'(w) for0<w<w 


= 4x for W<w 

where W =c,(q4,) is the lowest wage rate at 
which L is willing to produce the output qx. 
Since c, is monotonic, wage increases on 
the interval [0,w] relax the participation 
constraint and raise output. This effect van- 
ishes once G, becomes feasible. 
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s(q) 


FiGuRE 1. SURPLUS AS A FUNCTION OF OUTPUT 


Substituting (3) back into (1), one obtains 
indirect utility functions which depend solely 
on the wage: 


(4) 


=u 


There are two cases to consider, depending 
on whether or not the participation con- 
straint is binding: 


(5) if0<w<w,then 


Uy(w) =0 


if <w, then UK(W)=Sk+W-w 


where | adopt the notational shorthand s, 
= s(qx). 

The functions vg, and v, are shown in 
Figure 2. In each part of the figure, I distin- 
guish two different situations. To the left of 
w, L’s payoff is zero since the participation 
constraint binds. K therefore appropriates 
the full surplus s[q,(w)]. This surplus at- 
tains a maximum of s* at the wage w* 
which induces K to choose the optimal out- 
put q*. Further wage increases will raise 


123 
= s(q") 

q 

q* qo 
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A. 
Vie 
vi(w) 
0 
w W Wo 
vK(W) 
W = [0,Wo] 
B. 
Vie Vb 
vy (w) 
s* = §(q*)} --------= 
0 
Sq = 


FiGcure 2. INpirEct UTILiry FUNCTIONS FOR THE KMF 
A) WHEN Sx > 0 AND B) WHEN Sx <0 


output and reduce surplus until w is 
reached. 

To the right of w, the participation con- 
straint for labor is irrelevant. K’s preferred 
output choice gy can be implemented, and 
total surplus is constant at s,. Variations in 
the wage in this region serve only to redis- 
tribute the fixed surplus between K and L. 
To the right of wy, K’s payoff becomes 
negative. 

Figures 2A and 2B differ with respect to 
the sign of the surplus s,. When K has only 
a mild incentive to overproduce (4, <q), 
the surplus s, is positive as in Figure 2A. 
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In this case, there are wage bargains in 
which K and L both have strictly positive 
payoffs. L has nothing to lose by insisting 
on such a wage, because the alternative 
is a payoff of zero. However, any wage 
high enough to give L a positive payoff 
necessarily induces K to choose the exces- 
sive output Gx. 

If K’s incentive to overproduce is stronger 
(4x >), the surplus s, is negative as in 
Figure 2B. In this case the labor-participa- 
tion constraint binds throughout the rele- 
vant interval [0,w,], and L cannot obtain a 
positive payoff at the bargaining stage. Since 
L is indifferent and K would like to maxi- 
mize total surplus, it is plausible to expect 
an agreement on the wage w*. With this 
bargaining outcome, the firm adopts the 
correct output qg*, and K appropriates the 
entire (maximized) surplus s*. 


III. Wage Bargaining in the Capitalist Firm 


I now formalize these ideas about bar- 
gaining outcomes. It is evident from Figure 
2 that K and L would like to adopt the wage 
w* and then negotiate over the distribution 
of the surplus s*. However, this is infeasible 
by assumption (b) in Section I. We there- 
fore need an explicit model of the proce- 
dure through which wages are determined. 

I will initially assume s, > 0 as in Figure 
2A, since this situation raises more complex 
bargaining issues. (The case s, <0 is dis- 
cussed toward the end of this section.) When 
Sx > 0, the set of utility pairs generated by 
wage Offers in the interval W =[0,w,] is 
nonconvex, implying that the Nash bargain- 
ing solution cannot be used. I turn instead 
to an alternating-offers game of the Rubin- 
stein (1982) type, where K opens with some 
proposal w€@W which L either accepts or 
rejects. If L rejects this offer, L makes a 
counteroffer in the following period, which 
K accepts or rejects, and so on. If this 
process ends with an acceptance, the ac- 
cepted wage becomes binding, and produc- 
tion occurs. Otherwise each agent receives a 
zero payoft. 

The analysis is straightforward if wage 
offers are confined to the interval [W,w,] in 
Figure 2A, since then a pie of fixed size is 


) 
W = [0,Wo] 
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being divided. It is well known that the 
Rubinstein game has a unique perfect equi- 
librium in this case. However, when offers 
throughout the interval W =[0,w,] are ad- 
missible, total surplus varies, and neither 
utility function is strictly monotonic in the 
wage. This environment generates many 
perfect equilibria, some of which give L a 
zero payoff. 

There is nevertheless a strong intuition 
that wage bargains should lie in the range 
(W,w,) where both payoffs are positive. Any 
bargaining strategy in which L accepts a 
zero payoff (e.g., the wage w*) is dominated 
by an otherwise identical strategy where L 
rejects such proposals and insists instead on 
a positive payoff. If L does hold out for a 
positive payoff, K is better off with an 
agreement in the range (W,w,) than with 
perpetual deadlock. 

We formalize this intuition in the follow- 
ing way. Let the one-period discount fact- 
ors associated with the Rubinstein game be 
5, €(0,1) and 6, €(0,1). In addition to the 
conventional requirement of perfection, we 
impose two further restrictions on the bar- 
gaining strategies of the players. 


ASSUMPTION 5: The equilibrium bargain- 
ing strategies are stationary. 


ASSUMPTION 6: In cases of indifference, 
existing proposals are accepted. 


Assumption 5 means that each agent’s ac- 
tions are independent of events in earlier 
bargaining periods. A stationary strategy o, 
= (wx, Z,) is characterized by a wage offer 
Wx which K always proposes, and a set 
Zx © W of wage offers K is always prepared 
to accept. A stationary strategy for L is 
written ao, =(w,,2Z,). The simplicity of such 
strategies provides an appealing basis for 
narrowing the set of possible equilibria. As- 
sumption 6 ensures that an agent accepts a 
proposal when there is no hope of doing 


‘Raquel Fernandez and Jacob Glazer (1991) estab- 
lish that when a union has the option of striking, some 
perfect equilibria are inefficient. However, these 
Pareto-dominated bargaining outcomes involve non- 
stationary strategies. 
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any better by prolonging the bargaining pro- 
cess.” 


PROPOSITION 1: The only subgame- 
perfect equilibria in which Assumptions 5 and 
6 both hold are the following: 


(i) Extended Rubinstein Sqlution (ERS): wx, 
W,, Zx, and Z, satisfy 


(1—6,)W+6,(1— 6x) 


dx(1— 6,)W +(1— dx) 


Wy 


Zx= {w EW: v,(w)= 5xUx(Wx)} 


Z, =[wx,Wo]. 


(ii) Residual Claimant Optimum (RCO): wx, 
w,,Zx, and Z, satisfy 


WK=w* 

w, is an arbitrary element of [0,w,| 
{we W: vyg(w) > 

Z, =[0,»]. 


The RCO strategies form a perfect equilib- 
rium if (and only if) 5ys* = Sx. 


(The proof is available from the author upon 
request.) 

The ERS equilibrium is identical to the 
usual Rubinstein solution, but it extends 
this solution by permitting any wage offer in 
the full interval W =[0,w,] rather than just 
the truncated set [W,w,] where total surplus 
is constant. The ERS wage is w, if K makes 
the first offer and w, otherwise. The key 


‘These conditions do not by themselves rule out a 
zero payoff for L. Such outcomes occur in the RCO 
equilibrium described by Proposition 1. To establish 
that L’s payoff is positive, it is necessary to impose the 
further condition that the equilibrium strategies are 
undominated. Proposition 2 can then be used to elimi- 
nate equilibria in which L receives a zero payoff. 


| 


126 THE AMERICAN ECONOMIC REVIEW 


feature of this equilibrium is that w, and 
w, both lie in the interior of the truncated 
set [W,w,] even though this restriction 
was not imposed a priori. Because K fails to 
internalize the effort cost borne by L, out- 
put is excessive (G, > q*), and total surplus 
falls short of the achievable maximum (s, 

In an equilibrium of the RCO type, L 
accepts any wage offer from K. Accordingly, 
K demands the wage w* and only accepts 
wage proposals that are at least as attractive 
as the maximum surplus s* received with a 
one-period delay. L’s own wage proposal w, 
is arbitrary because L has no reason to 
make acceptable offers: any wage accept- 
able to K has zero utility for L. 

I turn next to the problem of choosing 
between the ERS and RCO outcomes. A 
dominance result indicates that the ERS 
solution should be chosen and thereby rati- 
fies the earlier intuition that the wage 
agreement should give both sides a positive 
payoff. 


Definition: The bargaining strategy o, is 
dominated if there exists some other strat- 
egy 0% such that 


(a) against any strategy o, K’s payoff on 
every subgame from the use of oy is at 
least as large as the payoff from o,; and 

(b) there exists some a, and some subgame 
such that K’s payoff from o, is greater 
than the payoff from ox. 


Otherwise, o, is undominated. 


The strategies in this definition need not be 
stationary. All subscripts are reversed to 
define dominance in the case of L. 


PROPOSITION 2: The strategies of K and 
L in the ERS equilibrium are undominated, 
but L’s strategy in any RCO equilibrium is 
dominated. 


(The proof is available from the author upon 
request.) Since bargainers can be expected 
to discard dominated strategies, this result 
shows that the ERS equilibrium should be 
selected. 


MARCH 1993 


Assuming that K makes the first wage 
proposal,'” one can write the ERS wage 
bargain more compactly as 


(6) Wx =aw +(1-—a)wo 


where a=(1—6,)/(— 6,6, ). The power 
parameter a reflects the time preferences 
of the two parties.'! As usually happens in 
Rubinstein games, more patient agents en- 
joy a bargaining advantage (w, is decreas- 
ing in 6, and increasing in 6, ). Upon sub- 
stituting the wage bargain from (6) back 
into (5), one obtains the equilibrium KMF 
payofts: 


(7a) 


ifs,>0,then 
vKMF = (1—a@)5,. 


Finally, I return to the situation where 
K’s overproduction incentive is strong (s, < 
0) and Figure 2B applies. Here the bargain- 
ing problem disappears, because L’s payoff 
is identically zero on the interval [0,w]. 
Intuition suggests the outcome w*. More 
formally, since no ERS equilibrium exists 
for the case shown in Figure 2B, Proposi- 
tion 2 is irrelevant, and any equilibrium 
must be of the RCO variety. Proposition 1 
indicates that as the time between wage 
offers approaches zero (6, approaches 
unity), K’s acceptance set Z, shrinks to w*. 
If L makes proposals that are acceptable to 
K when L is indifferent, in the limit w* is 
the unique equilibrium bargaining outcome. 


The asymmetry introduced by designating K as the 


first mover vanishes when the time between bargaining 
proposals goes to zero because w, and w, approach a 
common limit. Write 5, =e and 6, =e 
where px and p, are continuous-time discount rates 
and A is the time between proposals. Then, 


lim wy(A)= lim w,(A)=aw+(1-a)wo 
4-0 A-0 


where a= /(px + py). 

"The parameter a could also reflect the risk atti- 
tudes of K and L (Ken Binmore et al., 1986; Osborne 
and Rubinstein, 1990 Ch. 4). 
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It follows that 


(7b) ifs, <0, then vkMF= 


— 


In this case output is optimal, and K appro- 
priates all quasi-rent. 

There is a sharp discontinuity in the bar- 
gaining outcome at the transition from s, 
> 0 to <0: wages drop abruptly from w 
to w*, output drops from K’s preferred level 
dx to the surplus-maximizing level g*, and 
total surplus jumps from s, to s*. The 
reason is that a nonconvex utility possibility 
set is replaced by a convex one at s, =0. 
Indeed, the set of feasible utility pairs is 
degenerate for s, <0 since only K can ob- 
tain a positive payoff in this case. 

I now pause to summarize the main 
implications derived from the bargaining 
analysis. When workers cannot obtain a 
positive payoff under any wage agreement 
(sx <0), K imposes a wage that supports 
the optimal output level q* (through the 
RCO bargaining equilibrium). If there are 
some wage agreements that would give 
workers positive quasi-rent (s, > 0), then 
such an agreement is actually adopted 
(through the ERS bargaining equilibrium). 
In the latter case, the labor-participation 
constraint is irrelevant, and K chooses the 
excessive output level gx. 

One empirical check on the model fol- 
lows from the prediction that workers should 
appropriate a share of the capitalist firm’s 
quasi-rent whenever output is excessive. 
Econometric evidence indicates that work- 
ers often do capture substantial rents. 
This finding can of course be explained 
in a number of different ways. Efficiency- 
wage theorists view labor rents as a profit- 
maximizing response by employers to prob- 
lems of turnover, adverse selection, or costly 
supervision (Lawrence Katz, 1986; William 
Dickens and Katz, 1987; Alan Krueger and 
Lawrence Summers, 1988). Others have ar- 
gued that workers capture part of the excess 
profit resulting from product-market imper- 
fections (Thomas Pugel, 1980; Clark, 1984; 
Michael Salinger, 1984). While one cannot 
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say with confidence that bargaining pro- 
cesses are responsible, such evidence for 
the existence of labor rents is at least com- 
patible with the bargaining model devel- 
oped here. 


IV. Equilibrium Organizational Forms 


I can now return to the question posed in 
the Introduction: what determines the equi- 
librium mode of organization? First, a brief 
discussion of the LMF is needed. In firms of 
this type, labor is the residual claimant, and 
capital is the hired input. Due to this rever- 
sal in roles, the payoff functions from (1) 
are replaced by 


where r is the rental fee paid by L for the 
use of K’s physical assets. Since K and L 
cannot contract directly on the intensity with 
which assets are used, r is a lump-sum 
transfer which does not vary with output. 
This restriction on LMF’s is motivated by 
Klein et al. (1978), Jensen and Meckling 
(1979), and Putterman (1988), who empha- 
size the cost to absentee owners of monitor- 
ing the use or abuse of physical assets. 

Due to the symmetry of the KMF and 
LMF, the analysis from Sections II and II 
goes through for the LMF with an inter- 
change of subscripts. Since the surplus func- 
tion s(q) defined in (2) does not depend on 
organizational form, the surplus-maximizing 
output g* remains the same. The uncon- 
strained capitalist output (q¢,) is replaced 
for the LMF by the unconstrained worker 
output (g,) which maximizes the payoff 
function u, in (1’). 

Let s, =s(q,) be the surplus generated 
by labor’s preferred output. The equilib- 
rium payoffs and vtMF from bargain- 
ing over the capitalist’s rental fee are sym- 
metric to those listed in (7) for the KMF. 
For easy reference, | state the payoffs from 
each organizational form in Table 1. To 
highlight the fact that output choices de- 
pend on market prices, I henceforth denote 
the surplus-maximizing output, the uncon- 


: 
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TABLE 1—Gross PAYOFFS TO FACTOR SUPPLIERS 
FROM THE KMF anp LMF 


KMF 
Sx(p)> 0 Sx(p) <0 
p): asy(p) s*(p) 
Op): a@)sy(p) 0 
LMF 
s(p)>0 s,(p) <0 
viMF( p): as;(p) 0 
yp p): (1—a)s,(p) s*(p) 
strained KMF output, and the uncon- 


strained LMF output by q*(p), G,(p), and 
G,(p). The surplus levels corresponding to 
these outputs are represented by s*(p), 
5x(p), and s,(p) in Table 1. 

In order for quasi-rents to arise in equi- 
librium, at least one factor supplier must 
incur a sunk setup cost prior to the bargain- 
ing stage. These costs will be denoted by 
yx 2Oand y, >0.'° For instance, y, could 
be the sunk cost associated with a factory, 
while y, could be an expenditure on spe- 
cialized worker training. I assume that these 
expenditures cannot be transferred from one 
factor supplier to another: using the termi- 
nology of the Introduction, such invest- 
ments are noncontractible.'* This assump- 
tion will be discussed at greater length in 
Section V. 


'21f the time between bargaining offers is short, the 
identity of the first mover is immaterial, and the power 
parameter a is identical in the KMF and LMF (see 
footnote 10). 

ignore the case yx =y,_=09, since then free 
entry drives price to zero in the product market. 

14For simplicity I assume that specialized assets are 
lumpy. The qualitative features of market equilibrium 
would be the same if investments were modeled con- 
tinuously, since hired inputs never get positive returns 
when surplus is maximized. Surplus maximization is 
thus incompatible with any positive investment on the 
part of hired inputs (see the remarks on Table 2 in the 
text). This conclusion is peculiar to the bargaining 
analysis of Section III. In a different modeling frame- 
work, each input might capture positive quasi-rent in 
each type of firm. The quantitative size of the invest- 
ments elicited by the KMF and LMF would then need 
to be examined (an illustrative approach is that of Hart 
and Moore [1990]). 
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A viable organizational form must give 
both factor suppliers nonnegative ex ante 
payoffs, so that entrepreneurs who establish 
such firms can attract all required inputs. 


Definition: The KMF is viable at a price p 
if both factor suppliers have nonnegative 
net payoffs at this price; that is, veMF( p) > 
yx and vS™F(p)> y,. LMF viability is de- 
fined analogously. 


Viability means that all ex ante participa- 
tion constraints are satisfied simultaneously 
at the going output price. This idea moti- 
vates the following definition of market 
equilibrium. 


Definition: Let py, be the lowest price at 
which the KMEF is viable, and let p, be the 
lowest price at which the LMF is viable. 
Then the KMF is the equilibrium organiza- 
tional form if py < p,. Conversely, the LMF 
is the equilibrium form if p,; < py. If px = 
p,, then the KMF and LMF can coexist in 
equilibrium. 


Entry by viable organizational forms places 
downward pressure on price. The preceding 
definition defines equilibrium organiza- 
tional forms to be those which survive this 
competitive pressure in the long run. 
Before describing market equilibrium in 
more detail, I add a further requirement to 
Assumptions 1-4 from Section II. 


ASSUMPTION 7: The functions s,(p) and 
S,(p) are strictly monotonic. 


Since s,(0)=s5,(0)=0, Assumption 7 im- 
plies that the functions s, and s, never 
change sign. I can therefore refer to the 
signs of s, and s, unambiguously without 
having to specify the particular output price 
at which these signs are determined.'° This 
facilitates an application of the bargaining 
results derived in Section III. 


'SAssumption 7 holds whenever the cost functions 
Cx and c, are quadratic (other special features of the 
quadratic case are noted at the end of Section IV). The 
maximum surplus s*(p) is always strictly increasing by 
the envelope theorem. 


3 
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TABLE 2—MARKET EQUILIBRIUM 


Equilibrium 
Case organization 


Output Equilibrium 
per firm price 


A. Sx <0, 5, <0 
l. yx > 0, yy =0 KMF 
2. ¥K =0, >0 LMF 
3. yx > 0, y, > 0 none 


> 0, 5, <0 
l. yp =O 
2. >0 


LMF 
KMF Sx 
SK <0, 5, >0 
l. y, =0 

2. y, >0 


KMF 
LMF 


S*( pK J=YVxK 
D. sx > 0,5, >0 


lL. py 
2. PK> Pi 


KMF* 


Sx (py) =S no 
LMF? 


no 


PK) 
Dy) 


Notes: Outputs denoted by g* maximize total surplus at the equilibrium price. 
Outputs denoted by g indicate overproduction. For each set of conditions, the 
equilibrium price satisfies the given equation. 

The surplus level 5 is defined by 5=max{y, /a, y,; /—a)}. In the “optimal 
entry?” column, “yes” means that aggregate surplus cannot be increased through 
further entry. In such cases, no input supplier receives a rent. A “no” means that 
entry is suboptimal if and only if yy /a#y, /U—a@). If yp /a> @), then L 
receives a rent; if this inequality is reversed, then K receives a rent. 

“When the KMF is an equilibrium in this case, sx(p,)> 5,(px) so that the KMF 
surplus is larger at the equilibrium price px. 

”When the LMF is an equilibrium in this case, s,;(p, ) 


> Sx(p,) so that the LMF 
surplus is larger at the equilibrium price p,. 


The following proposition characterizes 
the minimum price at which each organiza- 
tional form is viable. 


PROPOSITION 3: Suppose that i= K, L is 
the residual claimant and that Assumptions 
1-4 and Assumption 7 hold. The lowest price 
p; for which the iMF is viable is determined 
as follows. 


(a) Suppose s;<0. Let j be the hired input. 
Then (i) when y; > 0, the iMF is never viable; 
but (ii) when y,=0, the price p, satisfies 
s*( p;) = y;. 


(b) Suppose s; > 0. Then p; satisfies 


2 Yk 
S,( p;) = 5 = max; —, 
a 


PROOF: 

(a) From Table 1, the residual claimant’s 
net payoff is s*( p)— y;, and the hired input’s 
net payoff is —y;. If y;>0, the iMF is 


nonviable because the hired input’s net pay- 
off is always negative. However, y; = 0 im- 
plies that the hired input’s net payoff is 
always zero. By the monotonicity of s*(p), 
the lowest price p, that gives the residual 
claimant a nonnegative payoff is determined 
by s*(p,)= y;. 

(b) From Table 1, K’s net payoff in the 
iMF is as{p)— yx, and L’s net payoff in 
the iMF is (1—a@)s(p)— y,. Since s,(p) is 
monotonic, the lowest price at which both 
net payoffs are nonnegative is given by the 
equation stated in part (b). 


Table 2 characterizes the market equilib- 
rium corresponding to each sign combina- 
tion for s, and s,. Since the results follow 
directly from Proposition 3, formal proofs 
are omitted. Several generalizations emerge 
from Table 2. 


(a) When s, <0, assigning residual claims 
and managerial rights to input 7 leads to 


129 
entry? 
q*(p, s*( pe) = ¥x yes 
q*( py) = yes 
B 
yes 
no 
C 
yes 
no 
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surplus-maximizing output choices. This 
follows from the nature of the RCO 
bargaining equilibrium in Section III. 
Total surplus is maximized by both 
KMF’s and LMF’s in case A, LMF’s in 
case B, and KMP’s in case C. 

(b) Market equilibrium assigns residual 
claims to specialized inputs (those with 
positive setup costs) in cases A-C. This 
is not always compatible with surplus 
maximization. In cases B2 and C2, sur- 
plus maximization demands that hired 
inputs invest in specialized assets. How- 
ever, since hired inputs capture no 
quasi-rent ex post, such firms are nonvi- 
able. 

(c) Entry is optimal when neither input has 
an equilibrium rent. This implies s* — 
Y¥x —y_=9 so that net surplus cannot 
be increased through additional entry. If 
one input obtains a rent, gross surplus 
per firm (5 in the notation of Table 2) 
exceeds y, + y, in equilibrium. Hence, 
net surplus would be enhanced by fur- 
ther entry. Entry ceases prematurely be- 
cause the return on investments in spe- 
cialized assets has already fallen to zero 
for the input lacking an equilibrium rent. 


To see the implications for capitalist and 
labor-managed firms more clearly, suppose 
that capital is specialized (y, > 0) but labor 
is not (y, =0). In cases A-C of Table 2, 
the KMF is the equilibrium organizational 
form (case D is ambiguous). In cases A and 
C, the KMF also has desirable welfare 
properties: output maximizes surplus in each 
firm, entry decisions maximize surplus net 
of setup costs, and neither input receives a 
rent. However, in case B, output per firm is 
too large, entry is suboptimal, and workers 
capture some quasi-rent. From the stand- 
point of surplus maximization, the LMF is 
preferable in case B, but it does not arise 
because the suppliers of specific physical 
assets cannot capture any quasi-rent as hired 
inputs. 

By symmetry, identical remarks apply to 
the LMF when labor is specialized but phys- 
ical capital is not. In this situation, socially 
undesirable outcomes occur in case C of 
Table 2. Here the LMF prevails in equilib- 
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rium, but surplus would be larger under 
capitalist management. KMF’s are not vi- 
able because they fail to induce investment 
in specialized human capital. 

I close this section with a comment on an 
important special case of Assumption 7. 
Suppose the cost functions c, and c, have 
the quadratic form 


= Yq’. 


It is easy to check that B > y implies s,(p) 
>0>s,(p) for all positive prices. When 
B <y these inequalities are reversed. In the 
former situation case B of Table 2 applies, 
while in the latter situation case C applies. 
Since the ambiguous case D is excluded, 
residual claimant status always goes to sup- 
pliers of specialized inputs when costs are 
quadratic. 


V. Wealth Maximization and 
Organizational Form 


Section IV showed that KMF’s can pre- 
vail in equilibrium even when LMF’s yield a 
larger surplus. To this extent, the bargain- 
ing model vindicates proponents of the 
labor-managed firm, who argue that LMF’s 
may be desirable despite their current 
marginal status in market economies. How- 
ever, the comfort provided to LMF advo- 
cates is tempered by the fact that a symmet- 
ric argument applies to LMF’s themselves: 
some existing LMF’s might yield a larger 
total surplus if they were reorganized as 
capitalist firms. 

Under the bargaining mechanism exam- 
ined here, hired inputs generally queue up 
for employment when equilibrium organiza- 
tional forms fail to maximize total surplus. 
However, the fundamental difficulty is not 
the failure of input markets to clear. To see 
why, suppose case B2 of Table 2 applies, so 
that only the LMF is surplus-maximizing. 
Assuming capital is specialized (y, > 0) 
while labor is not (y,; = 0), the KMF pre- 
vails in equilibrium. Labor captures a rent 
of (1— a)s,, and workers queue for jobs. 


cx(@) = Ba? 
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If employers could require workers to bid 
competitively for jobs, they would be com- 
pensated ex ante for the rent (1— a)s, and 
the labor market would clear. This proce- 
dure would also address the problem of 
suboptimal entry, since capitalist investors 
now capture the full surplus s,. Even so, 
the labor-participation constraint remains 
nonbinding at the production stage, leading 
to excessive output and a surplus of s, 
rather than s*. 

The underlying difficulty can be formu- 
lated in either of two ways. One view is that 
the KMF does not maximize surplus be- 
cause output is noncontractible [by assump- 
tion (a) in Section I]. However, if one takes 
the noncontractibility of output as a given, 
then the problem boils down instead to one 
of noncontractible investment. Surplus 
would be maximized if the LMF displaced 
the KMF, but this does not occur because 
LMF’s do not attract investment in physical 
capital. Since setup costs are borne nonco- 
operatively, capital suppliers will not under- 
take specialized investments that bring them 
no return at the bargaining stage, even if 
such investments enable workers to appro- 
priate large positive returns. Problems asso- 
ciated with shared returns from noncooper- 
ative investment decisions are commonplace 
in the economics of organization (see e.g., 
Hart and Moore, 1990). Because hired in- 
puts cannot obtain positive returns when 
surplus is maximized, here one has a partic- 
ularly stark version of this phenomenon: if 
the hired input incurs any positive setup 
cost, firms maximizing iotal surplus cannot 
be viable. A discontinuity appears when this 
setup cost goes to zero, since then it is 
unnecessary for the hired input to appropri- 
ate any quasi-rent. Accordingly, the 
surplus-maximizing mode of organization 
becomes viable.'® 


This discontinuity in the market equilibrium does 
not result from the lumpy character of investments. 
Rather, it follows from the fact that hired inputs re- 
ceive zero quasi-rent when surplus is maximized (see 
footnote 14). 
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Whether or not proper investment incen- 
tives can be restored depends on the avail- 
able contracting options. One solution is to 
have LMF members compensate external 
capital suppliers for their sunk costs. Even 
if K and L both know the characteristics of 
the assets put in place by K, however, the 
courts might find it impossible to verify these 
characteristics. K’s promise to supply assets 
in exchange for a side payment is worthless 
if L has no guarantee of compliance. Such 
contracts may also be too costly or complex 
for the parties to formulate in the first place. 

Of course, once a capitalist firm has been 
established and specialized assets are al- 
ready in place, there is an incentive to reas- 
sign residual claims to labor if the LMF 
would yield a larger surplus. However, when 
capital obtains no quasi-rent as a hired in- 
put and investment is noncontractible, there 
is no incentive for the outside asset owners 
to maintain the firm’s capital stock once it 
has been reorganized as an LMF. The long- 
run survival of the LMF is accordingly un- 
dermined.'’ In practice, employee takeovers 
of capitalist firms frequently occur when the 
firm is already facing financial difficulty. 
Maintenance of the existing capital stock 
(whether by outsiders or workers them- 
selves) is seldom a central concern when a 
takeover involves a failing firm. 

An alternative scenario is for K to create 
any needed assets unilaterally and then sell 
LMF membership rights to workers. By pay- 
ing an entry fee to K, workers would obtain 
residual claims on the firms’s profit stream, 
the right to manage the firm, and the right 
to negotiate a fee for the use of K’s physica! 
assets. If there are no frictions in the 
market for membership and the LMF is 
surplus-maximizing, K will extract the sur- 
plus s*(p) ex ante and will voluntarily incur 


The analogous problem for capitalist firms, that of 
inducing workers to acquire specialized skills, is fre- 
quently solved by having employers bear training costs 
or through informal training on the job. Because skills 
can often be maintained simply by using them, incen- 
tives for the maintenance of such assets seldom pose a 
serious dilemma in the capitalist firm. 


the sunk cost y, whenever s*(p)> y,. En- 
try decisions are optimal, and output is cho- 
sen correctly ex post. Since workers receive 
a zero payoff (net of membership fees) in 
equilibrium, the labor market clears. Mar- 
kets for LMF membership thus have some 
appeal (Dow, 1986, 1992, 1993a). 

This approach removes the need for 
third-party verification, since now prospec- 
tive LMF members can directly inspect the 
firm’s assets before deciding whether to pay 
the membership fee demanded by K. How- 
ever, incentives for K to maintain or expand 
the firm’s capital stock remain problematic 
(unless workers periodically bid to renew 
their membership rights, a procedure which 
raises other difficulties if workers gain 
firm-specific skills after the first round of 
bidding). The LMF membership market is 
also vulnerable to informational asymme- 
tries between K and L, since asset owners 
have incentives to exaggerate the actual 
value of their assets. Indeed, a similar valu- 
ation problem arises for employee buy-outs 
of existing capitalist firms (Ben-Ner and Jun, 
1990). 

Finally, LMF’s might combine the roles 
of capital and labor supplier by treating 
physical assets as the collective property of 
the enterprise. Under this solution, the LMF 
no longer hires capital from outside owners, 
and the bargaining problem vanishes. The 
main objection concerns the investment in- 
centives of the LMF. Several authors have 
asserted that LMF’s with collective capital 
ownership will underinvest (Furubotn, 1976; 
Jensen and Meckling, 1979). In theory, this 
defect can be cured by making membership 
rights marketable, but the practicality of 
this proposal remains uncertain (Dow, 
1993a). 

Further progress in understanding why 
capital hires labor thus appears to require a 
better understanding of LMF membership 
markets and specifically the nature of the 
impediments to such markets. This in turn 
will demand a better integration of the bar- 
gaining perspective with standard adverse- 
selection and moral-hazard concepts. Work 
along these lines should help to explain the 
prevalence of capitalist firms and may sug- 
gest improvements in the design of their 
labor-managed counterparts. 
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Advance-Purchase Discounts and Monopoly Allocation 
of Capacity 


By Ian L. GALE AND THomas J. Ho_MEs* 


Optimal pricing by a monopoly airline that faces capacity constraints during the 
peak demand period is studied. The existence of capacity constraints means that 
in order to expand output the airline must divert demand from the peak period 
to the off-peak period. A particular advance-purchase discount policy is shown 
to be the profit-maximizing method of selling tickets. If the advance-purchase 
requirement were infeasible, output and total surplus would both be lower. 


(JEL 112) 


Substantial discounts for air travel are 
generally available to travelers who are will- 
ing to purchase tickets a week or more in 
advance of the departure date. This paper 
demonstrates why the imposition of an 
advance-purchase requirement may be a 
profit-maximizing strategy for a monopoly 
airline facing capacity constraints during 
times of peak demand.' The practice of 
imposing an advance-purchase requirement 
on discount tickets is also placed in a favor- 
able light in terms of its impact on social 
welfare. 

Two features of demand are important 
for the analysis that follows. First, there is 


*Department of Economics, University of Wiscon- 
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sity of British Columbia, Georgetown University, the 
University of Illinois, the University of North Carolina, 
the University of Wisconsin, the Antitrust Division of 
the Department of Justice, and the Winter 1990 meet- 
ings of the Econometric Society. 

The idea that restricted discount fares may serve to 
divert demand from the peak to the ofi-peak period 
has been suggested by Steven Salop (1978), Severin 
Borenstein (1983), and Elizabeth Bailey et al. (1985), 
all of whom observed that the availability of discounts 
is often quite limited during peak demand periods. 
However, this earlier literature does not explain why 
airlines employ advance-purchase discounts rather than 
offering a lower, uniform fare for off-peak flights, for 
example. 


demand uncertainty at the individual level. 
Consumers are not certain in advance of 
their desired departure time. For example, 
a business traveler may not know when a 
business meeting will end. Second, con- 
sumers vary in terms of the disutility or 
“time cost” that they suffer if they take a 
flight that leaves at the wrong time. For 
example, business travelers are thought to 
have a high value of time compared to 
tourists. 

The existence of capacity constraints dur- 
ing peak demand periods means that in 
order to expand output the monopolist must 
divert demand from the peak period to the 
off-peak period, using some form of dis- 
count as an inducement. One way to accom- 
plish this diversion is to offer a uniform 
discount for off-peak flights. Under this 
scheme, all individuals taking off-peak flights 
obtain a discount, including business travel- 
ers with a high value of time who turn out 
to prefer an off-peak departure time. An 
advance-purchase requirement for the 
off-peak discount keeps such high-time-cost 
individuals from obtaining the discount, en- 
abling the monopolist to extract greater 
rents from them. Because the advance- 
purchase requirement enables the monopo- 
list to target the discount more precisely, 
there is more of an incentive to expand 
output by diverting demand to the off- 
peak period. As a result, total output and 
total surplus are higher than they would 
be if the advance-purchase restriction were 
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infeasible. (The effect on consumer surplus 
is ambiguous, however.) 

We also endogenize the choice of capac- 
ity. Increases in peak capacity allow the 
airline to attain a given level of aggregate 
output while diverting fewer consumers to 
the off-peak period. This enhances the firm’s 
ability to extract informational rents from 
consumers, sO a monopoly airline is in- 
clined to set capacity at higher levels than 
a welfare-maximizing social planner would. 
However, if the advance-purchase require- 
ment were infeasible, the monopolist would 
choose a level of capacity that is even more 
excessive from a social-welfare standpoint. 

In many models of imperfect price dis- 
crimination a “bad” is bundled with the 
good in order to facilitate surplus extraction 
by a monopoly seller. For example, a low 
price may be available only if the consumer 
is willing to spend time in a queue or travel 
to a distant store (see Raymond Chiang and 
Chester Spatt, 1982). Such schemes can be 
effective when a consumer’s reservation 
value for the product is positively correlated 
with her value of time. Moving from uni- 
form pricing to price discrimination may be 
welfare-enhancing in such cases, but the 
aggregate time cost borne by consumers 
represents a deadweight loss to society. In 
the current paper, the advance-purchase re- 
quirement serves to divert demand in the 
face of a capacity shortage. The diversion of 
consumers to the off-peak flight facilitates 
the expansion of output, and the socially 
optimal way to accomplish this expansion is 
to divert individuals with low values of time. 
As a consequence, the aggregate time cost 
does not represent a deadweight loss to 
society here. 

The basic demand and supply conditions 
are described in Section I. In Section II we 
formulate the seller’s problem as a mecha- 
nism design problem and solve for the opti- 
mal direct-revelation mechanism. We then 
show that the optimal mechanism is imple- 
mented by a pricing policy that offers an 
advance-purchase discount fare for the 
off-peak flight. Section III examines ad- 
vance-purchase policies in more detail and 
shows why they dominate policies in which 
uniform discounts are given for the off-peak 
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flight. Endogenous determination of capac- 
ity is discussed in Section IV. 


I. The Model 


Two flights operate each day on a particu- 
lar route. One departs at a time of high 
demand, the other at a time of low demand. 
There is a continuum of risk-neutral con- 
sumers who each demand one ticket, and 
the total measure of consumers is equal to 
1. Between t= 0 and t=1, a measure a€ 
(5,1) of consumers learn that they will pre- 
fer the “high-demand” flight. The remain- 
der learn that they will prefer the “low- 
demand” flight. (We also refer to the flights 
as the “peak” and the “off-peak.”) Since 
there is a continuum of consumers, there is 
no aggregate demand uncertainty. Each 
flight has capacity K, where 
Since K <a, there is not enough capacity to 
serve all consumers who prefer the high- 
demand flight. However, total capacity is 
sufficient to serve all consumers. We nor- 
malize marginal costs to zero and set fixed 
costs to zero as well. 

Each consumer has a reservation value 
r > 0 for her preferred flight, but consumers 
differ in terms of the value of their time. A 
“type-y” consumer suffers a time cost y € 
[0, ¥] if she flies at her less-preferred time, 
which gives her a reservation value of r — y 
for the less-preferred flight. The distribu- 
tion of consumer types is the same for those 
who prefer the peak as for those who prefer 
the off-peak. The density of types, f(-), is 
strictly positive on the support [0,7]. The 
cumulative distribution is the continuous 
function F(-). 


II. The Optimal Monopoly Policy 


There are many ways that an airplane 
could sell tickets for its flights. One strategy 
would be to post prices, which could de- 


*Preferences for the peak flight could also be posi- 
tively correlated with value of time [i.e., a’(y)> 0], 
which is plausible since business travelers may be more 
likely to prefer the peak departure time than are 
tourists. The results are qualitatively similar in this 
case. 
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pend on the time of the flight and the time 
of purchase. The firm could also sell tickets 
through some form of auction. Given the 
wide array of possible strategies, a complete 
analysis of this problem would appear to be 
a daunting task. However, the literature on 
monopoly pricing under incomplete infor- 
mation has made great progress in dealing 
with problems of this type (see Milton Har- 
ris and Artur Raviv, 1981; Roger Myerson, 
1981; Eric Maskin and John Riley, 1984). 
The papers in this literature consider a 
monopolist’s choice of a “direct-revelation 
mechanism” in which consumers report their 
private information to the seller. All subse- 
quent allocation decisions and payments de- 
pend on these reports. The major benefit of 
this approach is its generality: all members 
of a broad class of pricing mechanisms, in- 
cluding posted-price schemes and auctions, 
are equivalent to some direct-revelation 
mechanism.* 

This section formulates the mechanism- 
design problem and solves for the optimal 
direct-revelation mechanism. It is shown 
that the optimal mechanism is characterized 
by the “cutoff” y™. All individuals with 
time costs below y™ depart on the low- 
demand flight. Those individuals with time 
costs above y™ who prefer the low-demand 
flight take that flight. Those individuals with 
time costs above y™ who prefer the high- 
demand flight take that flight or do not fly. 
A particular posted-price policy, which of- 
fers an advance-purchase discount on the 
low-demand flight, is then shown to be 
equivalent to the optimal direct-revelation 
mechanism. 


A. The Optimal Direct-Revelation 
Mechanism 


All contracting takes place at t = 0. The 
mechanism that the seller announces must 
induce consumers to report their private 
information truthfully. In particular, they 


>The “revelation principle” ensures that attention 
can be restricted to direct-revelation mechanisms in 
which it is a Bayes-Nash equilibrium to report one’s 
type truthfully. See Myerson (1981) or Drew Fuden- 
berg and Jean Tirole (1991) for more details. 
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must find it optimal to divulge their time 
cost at ¢=0 and their flight preference at 
t= 1. It turns out that the latter constraint 
is not binding at the optimum. We first 
examine a relaxed problem in which the 
flight preference of each consumer is ob- 
servable at t=1. We subsequently show 
that the solution to the relaxed problem 
satisfies the truth-telling constraint at t = 1, 
so a solution to the relaxed problem is a 
solution to the original problem. 

The mechanism specifies the probability 
that an individual departs on a given flight 
as a function of her reported time cost and 
her observed flight preference. Let 0/(y) be 
the probability that an individual departs on 
flight 7 when she reports a time cost y and 
is observed to prefer flight i. From the 
consumer's perspective, one need only focus 
on two derived probabilities. The first is the 
ex ante probability that the consumer de- 
parts on the wrong flight: 


(1) w(y)=(1-a@)6i(y) + 


The first term on the right-hand side of 
(1) gives the probability that she prefers 
the low-demand flight but ends up taking 
the high-demand flight. [Recall that 1—a 
is the ex ante probability of preferring the 
low-demand flight, while 6};(y) is the prob- 
ability of taking the high-demand flight, 
conditional on preferring the low-demand 
flight.|] The second term covers the opposite 
case. The other derived probability is the 
probability of departing on one flight or the 
other: 


(2) d(y) 


+ + 


A consumer’s net surplus also depends on 
the expected payment made to the seller. 
Let z(y) be. the expected payment from a 
consumer who reports a time cost of y.* 


*Here, w(y) could be a certain payment that is 
collected at t = 0, or it could be the expected value of a 
random payment based on contingencies at t=1. In 
the mechanism discussed in Subsection II-B, only those 
consumers who fly will actually pay. 
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The expected net surplus of an individual 
whose true type is y’, but who reports y, is 


(3) V(y,y') 


Incentive compatibility (IC) requires that 
it be optimal to report one’s time cost truth- 
fully at t =0: 


y’ argmaxV(y,y’). 
ye[0,y] 


Truthful revelation at t=0 gives an ex- 
pected net surplus of v(y)=V(y, y). The 
first lemma states that IC can be expressed 
in terms of two conditions on w(-) and v(-). 
In particular, individuals with higher time 
costs are no more likely to fly at the wrong 
time than are consumers with lower time 
costs. (For the method of proof see Myer- 
son [1981] or Fudenberg and Tirole [1991].) 


LEMMA 1: IC holds if and only if w(-) is 
nonincreasing and v(-) satisfies the following 
condition: 


(4) v(y) = f’w(u) du v(5). 


Consumers enter into a contract only if 
their expected return from doing so is non- 
negative. This is the voluntary participation 
(VP) constraint: 


v(y)>0 forall y <[0,¥]. 


Since w(-)>0, if v(¥)>0 then VP holds 
whenever (4) is satisfied. The monopolist 
will extract all surplus from the highest type, 
so v(y)=0 in an optimal mechanism. 
We henceforth assume that (4) holds and 
that v(y)=0. When w(-) is nonincreasing, 
IC and VP will then be satisfied. 

Using (3) and (4), and the fact that 
v(y)=0, the expected payment from a 
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type-y consumer can be written as 
(5) ™(y)=d(y)r—w(y)y 


f'w(u) du. 


Together, the first two terms represent the 
net social surplus, while the third term is 
the net private surplus that accrues to the 
type-y consumer. Substituting in (5) and in- 
tegrating by parts gives the total revenue for 
the seller: 


(6) 
= F(y))) dy. 


Finally, define the level of output on flight 
j by 


(j=L,H). 


The firm’s problem can now be stated 
formally as expression (8), below, subject to: 


(C1) technical constraints: ai(y) > 0 for all i 
and j and y; 6),(y) <1 for all 
i and y; 

(C2) monotonicity: w(-) is nonincreasing; 

(C3) resource constraints: Q; < K(j = L,H). 


A regularity condition on the distribution 
of types simplifies the analysis that follows. 
First, define the function p(-) by 


F(y) 


The regularity condition is the following: 


(10) 


p(-) is monotone decreasing. 


(8) max 


—w(y)Ly-f(y)+ F(y)])) dy 


#3237 
i 
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It will be shown in Section III that (10) 
ultimately requires that marginal revenue, 
properly defined, be monotonic. 

The cutoff y™ is now defined in terms of 
two critical time costs. First, let y° be the 
unique root of p(y)=0. [If p(¥)> 0, then 
y” = ¥.] Second, let y* be the unique cutoff 
that satisfies 


(11) a[1—F(y*)| =K. 


There will be a measure K of consumers 
who prefer the high-demand flight and who 
have a time cost above y*. The optimal 
mechanism is characterized by the smaller 
of these two values: y™ = min{y®, y*}. The 
result is now stated. 


LEMMA 2: Assume that the regularity con- 
dition (10) is satisfied. The following proba- 
bility functions solve the seller’s problem: 


(12a) 6!(y)=1 and forally 


(12b) and of(y)=0 fory<y™ 


(12c) and 


7 


H; y\ — 
~ FO™)I 
The proof is in the Appendix. 

When capacity is large, few consumers 
need to be diverted to the low-demand flight 
in order to have a total output of Q = 1, so 
y< <y®. In this case the optimal cutoff is 
y™ = y¥, and there are just enough seats on 
the peak flight for all types y > y™. That is, 
for all y > y™. When capacity is 
small, > y°, which means that y™ < 
Demand exceeds supply in this case, and 
seats on the peak flight are rationed. (The 
allocation among these consumers does not 
affect anyone’s expected payoff. The opti- 
mal mechanism described above uses ran- 
dom rationing. There exist other solutions, 
all of which are payoff-equivalent for all 
individuals.) 

A consumer with a time cost y > y™ flies 
with an ex ante probability d(y)=(1—- a@)+ 


fory>y™. 
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a6\i(y). The probability of flying at the 
wrong time is w(y)=0 for all types y > y™, 
so these consumers get zero expected sur- 
plus. The expected payment for a consumer 
with a time cost y > y™ is therefore 


(13) 7(y)=[(1- + 


Consumers with y < y™ all take the off-peak 
flight with probability 1. The expected pay- 
ment for a consumer of type y<y™ is 
therefore 


(14) 


=r—-—ay™. 


The optimal direct-revelation mechanism 
for the relaxed problem in which flight pref- 
erences are publicly observable at t = 1 has 
been characterized. Individuals would have 
no incentive to misrepresent their flight 
preferences in the optimal mechanism (see 
footnote 5 for further discussion). Hence, 
the solution above also solves the original 
problem in which the truth-telling con- 
straint at = 1 is imposed. 


B. The Optimality of Advance-Purchase 
Discounts 


We now describe a posted-price policy 
which is equivalent to the optimal direct- 
revelation mechanism described above. The 
firm sets a “coach” fare p.~=r, which is 
paid by all consumers who purchase at t = 1 
and by anyone who purchases a ticket at 
t = 0 for the peak flight. The firm also sets a 
discount fare pp=r—ay™ for tickets on 
the off-peak flight that are purchased at 
t=0. The firm limits the refundability of 
the discount tickets. Specifically, the refund 
is less than r—y™. Finally, it is assumed 
that resale of tickets is impossible. 

A consumer facing this fare structure 
would never purchase a ticket for the peak 
flight at t = 0, since she has probability 1 — @ 
of preferring the off-peak flight. Either she 
will wait until t = 1 to purchase, and receive 
zero surplus, or she will purchase a ticket 
for the low-demand flight at t = 0. Since the 
latter strategy imposes an expected time 


= 
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cost of ay on a type-y consumer, her ex- 
pected net surplus from purchasing in ad- 
vance is 


(15) 


All consumers with y < y™ will purchase a 
seat on the low-demand flight at t = 0. Any- 
one who turns out to prefer the high- 
demand flight is better off taking the low- 
demand flight rather than seeking the 
partial refund. All consumers with a time 
cost above y™ will wait until ¢=1 to pur- 
chase and will attempt to purchase a ticket 
for their preferred flight. The expected pay- 
ments of all consumers are identical to their 
payments in the optimal direct-revelation 
mechanism, and the allocation is the same.” 
We conclude with the following proposition. 


r—Pp-ay=a(y™-y). 


PROPOSITION 1: Assume that the regular- 
ity condition (10) is satisfied. The optimal 
selling mechanism is implemented by a policy 
in which (i) the price of a ticket purchased at 
t=0 for the off-peak flight is r—ay™, and 
(ii) the price of all other tickets is r. 


Ill. Analysis of the Optimal Policy 


This section provides additional intuition 
underlying the results of Section II. Con- 
sider the class of advance-purchase discount 
(APD) policies that specify a price pp <r 
for a ticket on the off-peak flight that is 
purchased at t=0, and a price pc =r for 
all other tickets. We restrict attention to 
APD policies here since the results of Sec- 
tion II show that a particular APD policy 
implements the optimal mechanism. 

Suppose that the discount fare is equal to 
Pp, and let y be the time cost of the 
marginal consumer who purchases in ad- 
vance. It follows from (15) that p, =r —ay/. 
(We can also restrict attention to policies in 


“It is now clear that no consumer has an incentive to 
misrepresent her flight preference in the optimal 
direct-revelation mechanism. Individuals with y > y™ 
have a clear incentive to report their flight preference 
truthfully, since otherwise they might end up on the 
wrong flight. Individuals with y < y™ take the low- 
demand flight, whatever report they make about their 
flight preference, so they have no incentive to lie. 
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which # < y*.) It is convenient here to view 
the monopolist as choosing the cutoff y. 
Profit can then be written as 


(16) M(%)=r{K +(1-a@)[1- F($)]} 
The two terms account for revenues from 


the sale of coach tickets and discount tick- 
ets, respectively. For § < y*, the slope is 


f(¥) 


(17) II'(9)=af(¥) 


Note that II’(0)>0. As § increases, aggre- 
gate output expands, but the price of a 
discount ticket falls. When 0, the dis- 
count price is Pp, =r, so there is no first- 
order loss in revenue from existing pur- 
chasers when the discount fare is lowered. 
However, there is a first-order increase in 
sales, so the profit-maximizing level of 
advance-purchase sales is strictly greater 
than zero. 

When the cutoff satisfies $< y*, total 
output is 


(18) 


which is strictly increasing in § for ~ < y*. 
The cutoff can now be expressed as 9(Q), 
and §(Q)=[a: Taken 
gether, equations (17) and (18) indicate that 
the marginal revenue from increasing out- 
put is 


all 


= p(¥(Q)) 


for Q<1. Hence, the regularity condition 
requires that marginal revenue be mono- 
tone decreasing.® The optimum can now be 


F(9(Q)) 


°When marginal revenue and p(-) are not mono- 
tonic, lotteries are used to flatten out p(-). It is now 
possible that the monopolist will offer two advance- 
purchase prices, with quantity rationing at the lower 
price. For a related result see Jeremy Bulow and John 
Roberts (1989), for example. 
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interpreted as follows: output is expanded 
by selling off-peak advance-purchase dis- 
count tickets until marginal revenue is zero 
or until the boundary is reached and all 
consumers are buying. 

The method of diverting demand from 
peak periods to off-peak periods that has 
been most commonly studied in the litera- 
ture is explicit peak-load pricing. A uniform 
discount is given to all consumers who take 
the off-peak flight. We will refer to such a 
price structure as a uniform, off-peak dis- 
count (UOP) policy. If the capacity on the 
two flights were controlled by a continuum 
of “atomless” firms rather than a single 
monopolist, then the competitive equilib- 
rium price structure would entail a uniform, 
off-peak discount. In the remainder of this 
section we explain why an advance-purchase 
discount policy is superior to a uniform, 
off-peak discount policy for a profit-maxi- 
mizing monopolist. 

Suppose the seller were to offer a UOP 
policy in which the prices of the peak and 
off-peak flights were, respectively, r and 
r—y™.’ As in the optimal mechanism, de- 
mand for off-peak seats consists of (i) con- 
sumers who prefer the off-peak flight, and 
(ii) consumers who prefer the peak but who 
have time costs below y™. Letting II"(-) 
denote profit as a function of the cutoff 
when a UOP policy is employed, 


(20) T"(y™) 


which is lower than the profit from the APD 
policy. 

It is instructive to examine the sources of 
this difference in profits. The revenue from 
the peak flight is rK in both cases. How- 
ever, the revenues from the off-peak flight 
differ for two reasons. First, in the APD 
regime the discount only goes to the con- 
sumers who purchase at ¢ = 0. With a uni- 
form, off-peak discount, the discount goes 
to all passengers who take the off-peak 


"It can be shown that, when the monopolist is 
constrained to offer a UOP price structure, he will set 
price equal to r for the peak flight. 
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flight. Second, the advance-purchase price 
in the APD regime is r—a-y™, which is 
strictly greater than r—y™. The marginal 
consumer who purchases in advance in the 
APD regime must be compensated for the 
expected disutility associated with taking the 
off-peak flight. If consumers are induced to 
switch flights after they know their pre- 
ferred flight, as in the UOP regime, then 
the actual disutility of the marginal con- 
sumer who switches determines the dis- 
count. Hence, for a given level of output, 
the size of the discount and the measure of 
consumers who receive it are both smaller 
under the APD policy.* 

The differences between these two re- 
gimes are also germane when one studies 
the effect of banning advance-purchase 
sales. Suppose that the seller were re- 
stricted to using policies with a uniform, 
off-peak discount. When the peak price is 
equal to r and the off-peak price is equal to 
r — y, the change in profit as 9 varies is 


_ 
£0) 


~(1-a). 


For the same cutoff 9, the increase in profit 
as the cutoff is increased is lower here than 
when an APD policy is used. The profit- 
maximizing cutoffs from the optimal mecha- 
nism (y™) and from the UOP regime (y") 
are graphed in Figure 1, as functions of 
capacity. Output is Q=1, and the cutoff 
equals y* when capacity exceeds K™ and 
K” in the respective regimes. 

The incentive to increase the cutoff is 
greater in the APD regime for the very 
reasons given above. Under both an APD 
policy and a UOP policy, increasing the 
cutoff by Ay causes output to rise by 
af(y)Ay. The increase in the discount that 
is required to attain this rise in output is 


~The optimal mechanism can also be implemented 
by a policy with uniform prices r — y™ for the off-peak 
flight and r for the peak, and an “entry fee” of 
(1—a)y™. The entry fee is paid by all individuals, 
including those who ultimately do not fly, before flight 
preferences are known. 
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Ficure 1. PRoFit-MAxIMIzING CuTorFs As 
FUNCTIONS OF CAPACITY 


only aAy in the APD regime, compared to 
Aj in the UOP regime. This increase in the 
discount also goes to fewer recipients in the 
APD case. Marginal revenue is lower when 
the seller is restricted to using policies with 
a uniform off-peak discount, so output and 
total surplus are both lower. The effect on 
consumer welfare is ambiguous: in some 
cases all consumers are strictly worse off 
(ex ante) with APD; in other cases there is a 
Pareto improvement with APD.’ 


IV. Endogenous Determination of Capacity 


The analysis so far has taken the level of 
capacity as given. We now endogenize ca- 
pacity. Suppose that K units of capacity 
cost c(K), where c(-) is strictly increasing 
and convex, and c’($) < r. Suppose also that 
the regularity condition (10) is satisfied. 
There are two main results. First, the level 


"lf K>K® in Figure 1, then the optimal cutoff 
equals y in both regimes, and total surplus is the 
same in the two regimes. Since the monopoly profit is 
strictly higher with APD, consumer surplus must then 
be lower. Consumers of type y < y* are worse off in 
the APD regime because the discount for the off-peak 
flight in the APD regime is smaller than the off-peak 
discount in the UOP regime. Consumers of type y > y¥ 
are worse off since they receive no surplus in the APD 
regime whereas in the UOP regime there is a positive 
probability that they will prefer the off-peak flight and 
receive a discount. For other parameter values, the 
APD regime is Pareto-superior to the UOP regime. 
When r is small, for example, the optimal policy in the 
UOP regime is to set the off-peak price equal to r, so 
consumer surplus is zero for all individuals. 
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of capacity chosen by a monopolist is at 
least as great as the level chosen by a social 
planner. Second, total surplus is higher, and 
capacity is lower, for a monopolist who is 
able to offer advance-purchase discounts 
compared to one who can only offer a uni- 
form, off-peak discount. 

As our benchmark, we consider a social 
planner who selects capacity and the alloca- 
tion of seats to maximize total surplus. (The 
results will be qualitatively the same if the 
planner can only select capacity and the 
monopolist then selects prices.) When ca- 
pacity is already in place, and the cutoff is 
¥, total surplus is 


(22) 
af 


The first term represents the gross surplus, 
while the second represents the aggregate 
time cost. ‘he planner will divert all con- 
sumers with time costs y < y*(K)= 
min{r, y*} from the peak flight to the off- 
peak flight. When r < y*, some consumers 
with time costs y > r who prefer the peak 
flight will not fly. When r > y*, sufficiently 
many consumers are induced to switch that 
Q=1. 

Now consider the choice of capacity. The 
maximized value of total surplus, as a func- 
tion of the level of capacity chosen, can be 
written as 


(23) TS*(K)=[K+(1-a)+aF(y*(K))]r 
af ~¢(K). 
0 


Since y*(K) = min{r, y*}, the slope is 


(24) TS*(K) =min{r, y5}—c'(K). 


Note that TS*”’(K) <0 for all K €(4,a@). 
Now consider a monopolist who will use 

the optimal APD policy. Profit, as a func- 

tion of the level of capacity chosen, can be 


= 
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written as 


(25) Il(K)=r{K +(1-a)[1- F(y™)]} 


+(r—ay™)F(y™)—c(K). 


Straightforward calculations indicate that 


the slope is 
(26) Il’( K) = min ofly*) 


Since the monopolist values capacity at least 
as highly at the margin, the level of capacity 
chosen by the monopolist will be at least as 
great as the level chosen by the social plan- 
ner. (Note, however, that the monopolist 
does not have excess capacity on the peak 
flight. The level of capacity chosen under 
APD is strictly below a, since there is no 
first-order loss in revenue from increasing 
the cutoff above zero, and the optimal cut- 
off is never larger than y*.) 

When r is relatively small, the level of 
capacity chosen satisfies r=c’(K) for the 
planner and the monopolist. In both cases, 
aggregate output is Q <1, so increasing ca- 
pacity simply leads to an increase in sales 
for the peak flight. As r rises, the capacities 
increase, until r = y“ = c’(K). [The value of 
K at which y* =c’(K) is clearly unique.] 
Beyond this point, the monopolist’s capacity 
continues to rise, while the planner’s capac- 
ity remains constant. Eventually, a point is 
reached at which the monopolist’s capacity 
itself reaches a plateau. In the region where 
the planner and the monopolist choose the 
same capacity, output is strictly greater for 
the planner. In the region where the plan- 
ner’s capacity is strictly lower, the planner’s 
output (Q = 1) is at least as great. 

The idea that a monopolist would actu- 
ally invest in more capacity than would a 
social planner may seem counterintuitive at 
first. An explanation for this result is given 
for the case of large r. When K is large, 
output is Q = 1 for the monopolist and the 
planner both. Suppose, then, that r is suf- 
ficiently large that we are in that region. 
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The benefit to a social planner of increasing 
capacity at the margin by AK is a reduction 
of AK-y™ in the aggregate time cost, since 
AK fewer passengers need to be diverted to 
the off-peak flight. Under the APD scheme 
as well, AK fewer passengers need to be 
diverted, so (AK/a) fewer advance- 
purchase seats will be sold. These (AK /a) 
passengers no longer receive the discount 
ay, so this represents a gain to the seller 
of AK-y*. However, there is an additional 
gain, since the size of the discount will drop 
as the time cost of the marginal consumer 
drops. As a consequence, the incentive to 
expand capacity is greater for the 
monopolist. 

Now return to the case of a monopolist 
who is restricted to offering policies with a 
uniform discount for the off-peak flight. The 
capacity chosen here will be at least as large 
as when advance-purchase discounts are 
permitted. To see this, note that profit in 
the UOP regime is of the form: 


(27) 


—c(K) 


so the slope is 


(28) +—_ 
af(y*) 


—c'(K). 


The increase in profit from adding capacity 
at the margin is at least as great in the VOP 
regime as in the APD regime. 

From the standpoint of total surplus max- 
imization, the level of capacity chosen in the 
UOP regime is even more excessive than in 
the APD regime. The reason for this is 
clear. There are two ways for a monopolist 
to expand output. One is to divert more 
consumers to the off-peak flight; the other 
is to add more capacity. For a given K, it is 
relatively less costly to expand output by 
diverting consumers to the off-peak flight 
when advance-purchase discounts are used, 
so there is less of an incentive to expand 
capacity in the APD regime. 


= 
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While capacity is lower in the APD 
regime, total surplus is strictly greater. There 
are two cases to consider, depending on 
whether r is small or large. When r is 
small, capacity is the same in the two 
regimes, and there is excess demand for 
seats on the peak flight. Following the anal- 
ysis of Section III, output and total surplus 
are both strictly greater in the APD regime 
when r is small. As r increases, the APD 
policy reaches an output of Q=1 first. 
(Hence, output is always at least as large in 
the APD regime as in the UOP regime.) Let 
K* and K" be the capacities chosen in the 
two regimes when r is large. Total surplus 
in the APD regime is equal to TS*(K*), 
since the level of output (Q = 1) is socially 
optimal given the capacity chosen. Total 
surplus in the UOP regime is no larger than 
TS*(K"). The fact that TS*(-) is_strictly 
concave and the fact that K*<K*<K" 
[where K* maximizes TS*(K )] ensures that 
total surplus is strictly greater in the APD 
regime than in the UOP regime. 


V. Concluding Comments 


Suppose that it is uncertain initially which 
flight will be the high-demand flight. The 
optimal direct-revelation mechanism re- 
mains the same as in Proposition 1, but 
implementation via advance-purchase dis- 
counts is slightly different. Now, the flight 
time for which the advance-purchase ticket 
is valid is not announced until after the 
demand uncertainty is resolved. This type of 
policy, whereby a consumer pays in advance 
and is subsequently served at a time chosen 
by the seller, is seen in the Australian air- 
line industry, for example.'® Other situa- 
tions in which consumers accept a lower 
probability of being served at their pre- 
ferred time involve third-class mail and 
some electric-power contracts (see Robert 
Wilson, 1989). 

The model still predicts that no seats on 
the peak flight are sold at a discount. In 
reality, airlines do make some discount seats 
available on peak flights. In another paper 


We thank Jeff Borland for this example. 
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(Gale and Holmes, 1992), we argue that if 
the timing of peak demand is uncertain, and 
if offering contingent tickets of the type just 
described is infeasible, then the firm will 
sell advance-purchase discount tickets for 
both flights. Advance-purchase discounts 
smooth out demand fluctuations in that set- 
ting. 

In addition to the advance-purchase re- 
quirement for discount tickets, airlines usu- 
ally require that consumers purchase a 
round-trip ticket that includes a Saturday- 
night stayover. Suppose that some _ high- 
value-of-time individuals (i.e., business trav- 
elers) were certain in advance that their 
preferred flight was the off-peak. An 
advance-purchase requirement by itself 
would not exclude such individuals from 
receiving the discount. Suppose, however, 
that an observable characteristic, such as an 
unwillingness to stay over a Saturday night, 
were positively correlated with value of time. 
The firm might wish to impose further re- 
strictions in order to exclude such individu- 
als from receiving a discount. 

The main empirical prediction of this pa- 
per is that airlines will limit the availability 
of discount seats on peak flights. There is 
anecdotal evidence that airlines do this, but 
more careful study is needed."! Ideally, such 
a study would employ a flight-level data set 
that specifies, for a given date and time, the 
fares paid by all of the passengers on that 
flight. Unfortunately, this is greater detail 
than is available in the airline data sets 
currently in use. (An alternative strategy 
would be to collect data on the availability 
of discount seats by using the on-line com- 
puter reservation systems now available.) 

A finding that discount seats are in lim- 
ited supply during peak times is certainly 
consistent with the hypothesis that they 
serve a demand-diversion role. However, if 
high-reservation-value consumers are more 
likely to prefer to fly during times of peak 
demand, then the availability of discounts 


"Airlines sometimes have stated explicitly that cer- 
tain advance-purchase discounts are available only on 
off-peak flights (“Eastern to Offer Discounts on Off- 
Peak Flights in East” [Wall Street Journal, 9 August 
1987, p. 10)). 
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will be limited during peak times, even with- 
out capacity constraints. The validity of the 
demand-diversion hypothesis would require 
that the flights with limited availability of 
discount seats also be the flights that tend 
to be up against capacity constraints. 


APPENDIX 


PROOF OF LEMMA 2: 

First, substitute equations (1) and (2) for 
w(:) and d(-) into the objective function 
(8). If we ignore the monotonicity constraint 
(C2) and the resource constraint (C3) on the 
low-demand flight, we obtain the (continu- 
ous) linear program (LP) in expression (A1), 
below, subject to 


(A2) <1 forall y 


(A3) Of (y)+0t(y) <1 forall y 


(A4) adll(y)]dy < K 


(A5) forall i,j, and y. 


Let A'(y) and AM(y) be the multipliers 
on constraints (A2) and (A3), and let uw be 
the multiplier on the resource constraint 
(A4). The dual of the linear program (LP*) 
is 


(A6) min f +A"(y)] dy + Ku 
AL(-), AH(-), 9 
subject to 


(A7) 
(A8) 


AY(y)+ 

= f(y)U— a@)p(y) 
AN(y) = f(y) ae(y) 
AN(y) + f(y)ar 
Mi(y)>0 AM(y)>0 


(A9) 
(A10) 


(All) for all y 
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We claim that a solution to the primal 
problem (LP) is given by (12a)—(12c) and 
that a solution to the dual problem (LP*) is 
given by 


(Al2) forall y 
(A13) A"(y)=f(y)ap(y) 
A"(y) = f(y)ap(y™) 


for y<y™ 
for y>y™ 
(Al4) w=r—p(y”). 

The proposed solution (12a)—(12c) to LP is 


clearly feasible. The proposed solution to 
LP* is also feasible since p(y)>0 for y < 


y™. It is easily verified that the complemen- 


tary slackness conditions hold. Therefore, 
(12a)—(12c) solve LP. Since this solution sat- 
isfies the monotonicity constraint (C2) and 
the resource constraint (C3) on the low- 
demand flight, it solves the maximization 
problem (8). 
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Auctions With Endogenous Valuations: 
The Persistence of Monopoly Revisited 


By KrIisHNA* 


It is shown that the standard arguments for a monopoly to persist break down 
when many units of capacity become available sequentially. The reason is that 
deterring entry at one stage affects the cost of doing so in later stages. This force 
can be so powerful that entry is deterred only at the final stage. The persistence 
of monopoly or its erosion over time depends on the timing of arrival of 
capacity, the size of early versus subsequent capacity availability, the length of 
the capacity-acquisition stage relative to the life of the industry, and the discount 
factor. (JEL C72, D42, D44, F13, L12) 


When additional scarce resources become 
available over time, will these be acquired 
by a large incumbent firm in order to per- 
petuate its market power, or will these be 
acquired by new entrants to dissipate that 
power? If the additional resources become 
available as a single lump, a standard argu- 
ment (Richard J. Gilbert and David M. G. 
Newbery, 1982) suggests that these re- 
sources have more value to the incumbent 
than to a new entrant. Thus the incumbent 
would outbid the entrant, and monopoly 
would persist. In this paper, I argue that if 
the additional resources become available 
piecemeal rather than as a whole, this rea- 
soning needs to be amended. I study this 
problem by formulating it as an auction in 
which successive units of industry-specific 
capacity are bid for by the firms in the 
industry. 

The model developed here also applies to 
markets where output is limited by direct 
quantity controls, such as licenses to oper- 
ate or sell, on both the entry of new firms 
and the expansion of existing firms. In many 


*Fletcher School of Law and Diplomacy, Mugar 
250, Tufts University, Medford, MA 02155. I am grate- 
ful to Vijay Krishna for all his help, to Richard J. 
Gilbert for directing me to the paper by Tracy R. 
Lewis, to a referee for many useful comments, to the 
National Science Foundation for research support un- 
der Grant No. SE8822204, and to the Institute for 
International Economics in Stockholm for their hospi- 
tality in June 1989 when work on this paper was begun. 
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cities, taxicab operation is limited to those 
who hold medallions. More medallions are 
created and allocated occasionally. Simi- 
larly, takeoff and landing slots limit the 
number of flights at major airports. New 
slots are created by adding to airport capac- 
ity by building new runways or terminals 
and are valued differently by different firms. 
Quotas limit the level of imports directly, 
and import licenses have been used in a 
number of countries to implement quotas. 
Licenses have been allocated directly or 
sold, as in Colombia and New Zealand. 
Quota license auctions have also been pro- 
posed in the United States as a way of 
raising revenue (see C. Fred Bergsten et al., 
1987; Krishna, 1990a).! 

The examples given above share some 
common features. First, the value that a 
firm, incumbent or entrant, attaches to an 
additional unit of the resource depends on 
the distribution of the remaining units across 
firms. For example, how a monopolist with 
100 units of existing capacity will value the 
tenth of an additional ten units depends on 
whether the first nine units were acquired 
by the monopolist, acquired by a single ri- 
val, or evenly distributed across many firms. 
Second, since the units are made available 
sequentially, a firm’s willingness to pay for 


‘Robert C. Feenstra et al. (1990) look at the impli- 
cations of standard auction theory for quota license 
auctions. 


earlier units depends on how it anticipates 
later units to be allocated. This, of course, is 
derived from the values placed on the final 
distributions of capacity. I adopt the term 
endogenous valuations to refer to the first of 
these aspects, namely, that since the values 
are determined endogenously from the out- 
come of market interaction they depend on 
the allocation of all units.” 

Endogenous valuations occur naturally 
when profits are interrelated either through 
costs or through prices. Price interrelations 
are to be expected with imperfect competi- 
tion, as argued above. Cost interrelations 
can also lead to similar effects since the 
costs of production can depend on the allo- 
cation of the scarce resource. For example, 
consider the value of the right to drill a well 
to access a common pool of oil. The value 
of an additional drilling right depends on 
the overall allocation of such rights. This is 
because the pressure of oil in the common 
pool, and so the resulting cost of extraction, 
depends on the extraction levels of all par- 
ties. 

The valuations that result from such con- 
siderations take on forms that are different 
from those often postulated in auction the- 
ory. In the literature on multiple-unit auc- 
tions, the standard assumptions made are 
that the marginal valuations of successive 
units are constant and independent of the 
allocation of other units. Alternatively, it is 
assumed that each bidder has a downward- 
sloping demand curve.° Neither formulation 
seems appropriate when the valuations are 
determined as a result of market interac- 
tion. This, rather than whether the valua- 
tions are determined endogenously or speci- 
fied exogenously, is the key. In order to 
‘focus on these issues, in what follows, I 
assume that there is no uncertainty about 
the various aspects of the situation and that 
the agents are completely informed. Here I 


“Such effects have been noted before. For example, 
Michael L. Katz and Carl Shapiro (1986) recognize 
that valuations are endogenous in studying how to 
license technology. Morton I. Kamien and Israel Zang 
(1990) also recognize this in studying horizontal merg- 
ers. 

See Paul R. Milgrom (1985) for an excellent survey. 
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depart from a major concern of auction 
theory, namely, the role of incomplete in- 
formation. 

More closely related is some recent work 
by Ian Gale (1990a), who looks at the auc- 
tion of multiple objects with superadditive 
values. He shows that, under certain condi- 
tions, revenue is maximized when a single 
bidder acquires all the objects. Carolyn 
Pitchik (1989) studies sequential auctions 
with incomplete information, in which, al- 
though valuations are fixed, effective valua- 
tions of agents are endogenously deter- 
mined because of the presence of budget 
constraints. Effective valuations are like ef- 
fective demand in that they are valuations 
backed by purchasing power. 

The sequential-auction model is used to 
address the question of whether there are 
intrinsic forces at work that will lead to a 
concentration of scarce resources and en- 
sure that monopolization occurs. The stan- 
dard argument (Gilbert and Newbery, 1982) 
relies on the idea that, since competition 
destroys industry profits, deterring entry is 
more valuable to an incumbent than entry is 
to a potential entrant. While the basic idea 
is simple and powerful, there may be other 
factors that mitigate this effect. These in- 
clude: uncertainty about the technology of 
preemption or the entrant’s characteristics; 
superiority of the entrant’s costs or technol- 
ogy; and when the scarce resource is knowl- 
edge, the absence of well-defined property 
rights. If these effects are large, monopoly 
power may not persist.* 

However, none of these factors leads to 
an erosion of monopoly power in this paper. 
Rather, the model developed here points to 
another aspect of the problem, namely, that 
the threat of entry occurs at more than one 
point in time. The idea is that deterring 
entry at one stage raises the cost of doing so 
at future stages, and this force can be so 
powerful that entry is not deterred except at 
the final stage! 

In an insightful and closely related paper, 
Tracy R. Lewis (1983) makes a similar argu- 
ment. However, his analysis overestimates 


*See Jean Tirole (1989 pp. 350-2) for a detailed 
discussion. 
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the cost of preemption as it does not fully 
incorporate the effect of acquiring an addi- 
tional unit of capacity on the resulting value 
of future capacity. His work is discussed in 
more detail below, in Section III. 

David A. Malueg and Marius Schwartz 
(1991) look at the persistence-of-monopoly 
issue in a different light. They argue that 
monopoly may be vulnerable to entry be- 
cause the incumbent cannot always credibly 
undertake actions that mimic the outcome 
with entry. For instance, a monopolist who 
obtained an additional unit of capacity 
would not expand output by as much as an 
entrant would. This phenomenon also oc- 
curs in the model presented below. 

Also related are papers by Pankaj 
Ghemawat (1990), Gale (1990b), and 
Krishna (1990b). Ghemawat examines the 
value of a single unit of capacity to different 
sized firms that are engaged in capacity- 
constrained price competition. He finds that 
it is more valuable to the larger firm. Al- 
though he concludes that this will result in a 
“snowball effect” whereby the larger firm 
would keep getting larger, the analysis of 
this paper suggests that considering the 
overall game may not lead to such a conclu- 
sion. Gale (1990b) looks at the sequential 
auction of capacity in a Cournot duopoly 
when the firms may or may not be con- 
strained. When capacity is broken into two 
units, he shows that both will go to one firm. 
When it is broken into three units, they 
need not. This example is related to a gen- 
eral result developed in Krishna (1990b) 
which shows that when the payoffs to having 
x of n units is convex in x, monopolization 
must occur. When two units are sold, the 
payoffs to having x of n units must be 
either convex or concave. They tend to be 
convex because concentration raises indus- 
try profits. When three or more units are 
sold, there is no reason for the payoff func- 
tion to be convex even if industry profits rise 
with concentration, so that monopolization 
need not obtain. An example in Krishna 
(1990b) also demonstrates that concavity of 
the payoffs does not imply equal allocations. 

This paper is organized as follows. Some 
of the basic ideas and methodology are in- 
troduced in Section I by means of a simple 
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example. The example is that of an abstract, 
two-object, sequential auction with two bid- 
ders. Even though one of the bidders has a 
higher marginal valuation for both objects 
than the other, in the unique perfect equi- 
librium of the auction, he obtains only one 
of the objects. 

Section II introduces a model of a 
monopoly with a fringe of small competitors 
and many potential entrants. All firms bid 
to acquire additional units of a specific capi- 
tal good, which is referred to as “capacity.” 
As mentioned above, the term “capacity” is 
quite flexible and could refer to licenses to 
operate, import, and so on. Two detailed 
examples in this framework illustrate some 
of the forces at work. In one, monopoly 
power dissipates, and in the other, it per- 
sists. The difference in the two examples is 
only in the relative sizes of the “lots” and 
not in the total capacity sold. 

Section III presents two results that gen- 
eralize the examples of the previous section. 
Theorem 1 shows that when a number of 
lots of equal or increasing size become 
available sequentially, all but the last lot go 
to the fringe. Thus, monopoly power dissi- 
pates. Using this result, I also study some 
conditions under which capacity divestiture 
is profitable. Theorem 2 shows that if the 
units decrease in size at a fast enough rate, 
all the units go to the monopolist so that 
monopoly persists. Both results require 
some assumptions on the shape of the mar- 
ket demand function. Linear demand func- 
tions meet these conditions. 

In the basic model it is assumed that the 
capacity-acquisition stage occurs before any 
production from that capacity takes place. 
Section IV examines the effect of relaxing 
this assumption. The timing of payoffs and 
the discount rate are shown to be crucial to 
the results. Section V contains some con- 
cluding remarks. 


I. An Illustrative Example 


In this section I present a simple example 
that illustrates some of the basic issues. 
Consider a situation in which two agents 
compete to acquire C units of a resource 
which will be auctioned off one at a time. I 
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assume that all the agents have complete 
information about all aspects of the situa- 
tion. 

Consider first the simplest case in which 
one object is offered for sale to two agents. 
In the absence of any uncertainty, the ob- 
ject goes to the agent with the higher valua- 
tion, who pays a price (essentially) equal to 
the other agent’s valuation. The exact form 
of the auction is left unspecified since the 
outcome would be the same whether the 
auction were open, a sealed-bid second- 
price auction, or a sealed-bid first-price auc- 
tion.” Next, suppose that C units are sold 
sequentially. If the marginal valuations of 
the agents are constant and one agent’s 
marginal valuation is greater than the 
other’s, all the units would be acquired by 
the agent with the higher valuation. 

What if the marginal valuations are not 
constant? Consider a situation in which two 
units are sequentially offered to two agents, 
A and B. Suppose that A assigns a value of 
10 to having one unit and a value of 20 to 
having both. B assigns a value of 9 to having 
one unit and a value of 10 to having both. 
To solve for the equilibrium outcome of this 
game it is necessary to work backwards. 
This is most easily done by means of Figure 
1, in which a tree depicting the possible 
outcomes is drawn. Each unit could go to 
either A or B. At every node, the left branch 
denotes that the unit goes to A, and the 
right branch that it goes to B. It is impor- 
tant to note that this is different from the 
usual extensive-form depiction of a game. A 
complete extensive-form description of the 
game is too complicated. The abbreviated 
outcome tree contains all the relevant infor- 
mation and is more useful. 

The end points of the tree give the pay- 
offs associated with each of the four possi- 
ble outcomes. Arrows denote the allocation 
at each stage, and prices are given in brack- 
ets next to the paths. Consider the situation 
when the first unit has gone to A. A is 


°In the first two, the outcome is a result of a 
dominant-strategy equilibrium. In a first-price sealed- 
bid auction, it is the result of the a Nash equilibrium in 
undominated strategies. 
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(20,0) (10,9) (10,9) (0,10) 


FicureE 1. DepictiInG EQUILIBRIUM USING AN 
OutTCOME TREE 


willing to pay 10 for the second unit, whereas 
for B it is worth only 9. Thus, A would 
obtain the second unit as well, paying a 
price of 9. If B had the first unit, again the 
second unit would go to A, but he would 
pay only 1 for it. How much is A willing to 
pay for the first unit? To calculate this, the 
two nodes can be replaced by their equilib- 
rium values. These are (11,0) and (9,9), 
respectively. The value to A of obtaining 
the first unit is the difference between the 
values of the two nodes and thus equals 2. 
Similarly, the value to B of obtaining the 
first unit is 9. Thus, B would get the first 
unit at a price of 2. 

Notice that, although A always gets the 
last unit because his marginal valuation ex- 
ceeds B’s, the first unit goes to B! While at 
the last stage, the marginal valuation equals 
the willingness to pay, this is not the case in 
earlier stages. This is because in earlier 
stages the willingness to pay is détermined 
by the valuations and the anticipated equi- 
librium prices in later stages. In the exam- 
ple above, A lets B obtain the first unit so as 
to get the second unit for a very low price. 
The example illustrates that, even if one 
agent’s marginal valuations are always 
greater than those of the other, he need not 
obtain all the units! This example is at the 
heart of this paper, as the standard argu- 
ments about the persistence of monopoly 
are usually made in the context of a “‘once 
and for all” availability of capacity and rely 
on the fact that this capacity is more valu- 
able to a monopolist than to an entrant. As 


(9,7) 
: 
A B 
(11,0) (9,9) 
A B A B 
(9) 
| 
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I have just shown, when capacity becomes 
available sequentially this argument is in- 
sufficient. 


II. A Model of Monopoly with a Fringe 


In this section, I outline a simple model 
of a monopoly with a competitive fringe. It 
is assumed that there is a large number of 
potential entrants who can become part of 
the fringe by obtaining capacity. The firms 
in the fringe take the price set by the 
monopolist as the given market price. I wish 
to study how additional units of capacity, 
made available sequentially, would be allo- 
cated. 

Let demand for the product be given by 
Q(P). Let P™ be the price that would be 
charged by the monopolist if there were no 
fringe, and let Q(P™) < M, where M is the 
capacity that the monopolist already has. 
Now assume that C additional units of ca- 
pacity will become available and that having 
a unit of capacity enables one unit to be 
produced at a constant marginal cost, as- 
sumed without loss of generality to be zero. 

If the monopolist obtains x units and the 
fringe obtains C — x units, the monopolist 
chooses P to maximize residual profits: 


(1) 


Let P(x) be the profit-maximizing price and 
let f(x) = (P(x), x) denote the maximized 
value of the monopolist’s profits. Note that 
P(x) rises with x assuming that 7(P, x) is 
concave in P given x. Assume P(0Q)> 0. 
Also, note that f’(x)= P(x), by the enve- 
lope theorem. These properties imply that 
the marginal value of an additional unit of 
capacity to the monopolist is positive and 
increasing. 

The standard argument for persistence of 
monopoly is easy to see from Figure 2, 
which depicts P(x). Suppose that all C units 
became available as a lump. The value of 
this capacity to the monopolist is f(C)— 
f(O). By the fundamental theorem of calcu- 
lus, this equals 


Cc 
P(t) dt 
0 


which is the area under the P(-) curve. 
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FIGURE 2. AUCTION OF CAPACITY IN Two Lots 
OF EQUAL SIZE 


The value of this capacity to an entrant, 
assuming he had none to begin with, is 
P(O)C, since if he acquired the capacity, the 
price would be P(0).° From Figure 2 it is 
clear that the value of the additional capac- 
ity and hence the willingness to pay for it is 
greater for the monopolist than for an en- 
trant. Thus, the additional capacity will go 
to the monopolist. This is the essence of the 
argument for the persistence of monopoly 
given by Gilbert and Newbery (1982), among 
others. 

To begin, consider a situation in which 
the two units of additional capacity are se- 
quentially auctioned off. Suppose the de- 
mand curve is linear, so that P(-) is also 
linear, as drawn in Figure 2. Assume that 
there are R>2 potential entrants so that 
the situation is a game with R +1 players. 


“If the entrant had any capacity, his willingness to 
pay would be less than that of a new entrant. A new 
entrant’s valuation is the value of the units in the 
equilibrium of the subgame in which he gets the units 
in question. The willingness to pay of an entrant with 
some capacity is the difference between his total payoff 
when he gets the unit(s) and when he does not. This 
must be weakly less than that of a new entrant, as the 
price charged falls as the monopolist gets fewer units, 
which reduces the value of units already held. Thus, 
existing fringe members are outbid by new entrants. 
This also ensures that no fringe-member firm grows 
large enough not to use all its capacity. 
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Suppose that the first unit goes to the 
monopolist. Then the monopolist is willing 
to pay f(2)— f() for the second unit. On 
the other hand, the entrant is willing to pay 
P(1)1 as P(\) is the price that would occur 
if the entrant got the last unit. Again, note 
that the monopolist’s willingness to pay for 
the second unit, which equals the area un- 
der the P(-) line, exceeds that of the en- 
trant, since P(-) is an increasing function. 
Hence, in this subgame, the last unit goes to 
the monopolist who pays P(1). If the first 
unit goes to an entrant, the monopolist is 
willing to pay f(1)— f(0) for the second 
unit, whereas a new entrant is willing to pay 
P(0). As the willingness to pay of a new 
entrant exceeds that of an existing fringe 
firm, only new entrants need to be consid- 
ered. This occurs because a fringe firm with 
some capacity would internalize the fail in 
price from the monopolist not getting the 
unit, while a new entrant would not (see 
footnote 6). Thus, the monopolist’s willing- 
ness to pay exceeds that of a new entrant, 
so that the last unit goes to the monopolist 
who pays P(0) for it. 

Now consider the first unit. If the 
monopolist gets it, his payoff is f(2)— P()1. 
If he does not, his payoff is f(1)— P(0)1. 
Thus his willingness to pay is [ f(2)— f(1)]—- 
[P()— P(O)]1. If an entrant gets the first 
unit, he knows that the second will go to the 
monopolist, so that the resulting price will 
be P(1). Thus, an entrant will be willing to 
pay P(1)1 for the first unit. With linear 
demand, the monopolist’s willingness to pay 
is less than that of an entrant since [ f(2)— 
fQ)— PG)|-[PQ)— <0. The first 
term equals the striped triangle of Figure 2, 
while the second is the cross-hatched rect- 
angle, which has an area twice that of the 
triangle. In equilibrium, therefore, an en- 
trant gets the first unit while the monopolist 
gets the second, and monopoly is eroded! 
The reason is the same as in the example of 
the previous section: obtaining the first unit 
makes it costly to obtain the second, and 
the monopolist is better off letting it go to 
an entrant. 

To get further intuition about this, con- 
sider the more general situation in which C 
units of capacity are sequentially auctioned 
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off in two lots. The first lot is of size x and 
the second is of size C — x.’ Once again, it 
is easy to verify that the last lot (of size 
C — x) always goes to the monopolist. If the 
first lot (of size x) went to the entrant, the 
price paid for the last lot would be P(O)C 
— x), as P(O) is the price that would prevail 
if the entrant got the last lot as well. If the 
first lot went to the monopolist, the price 
paid for the second lot would be P(xC — 
x) for similar reasons. Thus, the payoff to 
the monopolist if the first lot went to the 
entrant is f(C — x)— P(OXC — x), and the 
price in this subgame is P(C — x). The en- 
trant’s willingness to pay for the first lot is 
therefore P(C — x)x. If the first lot went to 
the monopolist, his payoff would be f(C)— 
P(xX(C — x), as the last lot always goes to 
the monopolist. 

Now look at the first stage. The differ- 
ence between the willingness to pay of the 
monopolist and that of the entrant for the 
first lot is 


(2) [f(C)-f(C-x)- P(C- x)x] 
—[P(x)- P(O)](C- x). 


The first term in brackets is positive and 
reflects the higher willingness to pay of the 
monopolist at the last stage. However, if he 
gets the first lot, he pays more for the C — x 
units in the second stage. This is captured 
by the second term. With linear demand, 
the above expression has the same sign as 
x—2(C — x). Hence, if x >2(C— x), the 
monopolist would get both lots. If, on the 
other hand, x < 2(C — x), the entrant would 
get the first lot. The intuition is that if x is 
large, the gain from having a lower price for 
the second lot is small, and the game resem- 
bles the single-stage game. In the next sec- 
tion, the relationship between the sizes of 
the lots and the persistence of monopoly is 
explored in more detail. 


’The reader may find it useful to draw the outcome 
tree corresponding to this problem. 
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Ill. Two General Results 


Suppose that C units of additional capac- 
ity are made available in n lots which are 
sold sequentially. The size of the kth lot is 
x,. Theorem 1 shows that if P(-) is concave 
and if for all k,x,<x,,,, then the 
monopolist wins the auction only in the last 
stage, so that monopoly power is dissipated. 
This is quite a strong result: despite having 
a higher marginal valuation than entrants, 
the monopolist chooses to let entrants ob- 
tain all but the last lot. Theorem 2 shows 
that if demand is linear and for all k, x, > 
2X41, where X; = L7_,x;, then monopoly 
persists. Note that this requires x, to fall 
very fast. 

The analysis is facilitated by defining a 
sequence of functions that give the mono- 
polist’s equilibrium payoffs as obtained by 
folding the outcome tree back. For k= 
f(x) to be the 
monopolist’s equilibrium payoff in the sub- 
game in which he has x units of capacity 
and the last k lots remain. By definition, 
f, Ax) = fx) = r(P(x), x). For k > 0, 
f,,-,4%) is the larger of the values from the 
two options open to the monopolist. The 
monopolist could get the (n — k +1)st lot at 
the price he would have to pay for it, which 
is the same as the market price that would 
prevail in the subgame in which an entrant 
obtained the (n—k+1)st lot. The other 
option is to let the entrant get the lot. This 
defines the function recursively. While this 
is difficult to evaluate in general, Lemmas 1 
and 2 evaluate it for two interesting cases. 


LEMMA 1: If P(-) is concave and if for all 
kK, xX, X44 4, then 


(3) = fork >1 

fork =1. 
so that by repeated application, for all k, 


(4) = + Xn) — 


(The proof is contained in the Appendix.) 
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Theorem | below is an immediate conse- 
quence of Lemma 1 and provides sufficient 
conditions for the dissipation of monopoly 
power. 


THEOREM 1: If P(-) is concave and for 
all k, x, < then in equilibrium: 


(i) the first n—1 lots go to new entrants 
while only the last lot goes to the 
monopolist; 

(ii) the price per unit of capacity paid by 
entrants equals P(x,,), while the mono- 
polist pays P(Q) per unit of capacity; and 

(iii) entrants earn zero profits ex ante while 
the monopolist earns profits of f,(x,,)— 
f,(0)— P(O)x,, >0 from participating in 
the auction. 


PROOF: 

Part (i) follows directly from Lemma 1. 
For part (ii), by Lemma 1 the equilibrium 
allocation in every subgame consists of all 
but the last unit going to an entrant. The 
monopolist pays an entrant’s willingness to 
pay, and the entrant pays the equilibrium 
price in the subgame where he obtains the 
unit, which are the prices given. Part (iii) 
follows as in every subgame entrants pay 
the full value of the object in equilibrium, 
and the monopolist only pays the entrant’s 
marginal valuation at the last stage. 


Lemma | and Theorem | can also be used 
to shed some light on a few related ques- 
tions. 

First, consider the problem of finding the 
revenue-maximizing lot size in a sequential 
auction. Although the general problem is 
rather difficult, restricting attention to 
equal-lot-size auctions makes it tractable. 
Suppose that the total capacity, C, is sold in 
n lots of size x each and also suppose that 
demand is linear. By Theorem 1, the rev- 
enue raised in a sequential auction is calcu- 
lated as follows: 


(5) R(x) = P(x)(n—1)x + P(O)x 


P(x)xn—[P(x)— P(0)]x 


P(x)C —[ P(x) — P(0)] x. 
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FiGure 3. DIVESTITURE OF CAPACITY 


Since P(-) is linear, R(x) is maximized at 
x =C/2 where C equals the total capacity 
that is to be auctioned off. 

The case of equal-sized lots was also ex- 
amined by Lewis (1983). Without loss of 
generality, assume that each lot consists of 
one unit of capacity. Lewis looks at the cost 
of acquiring k of n units and argues that 
the price the monopolist must pay to do so 
is P(k)k, as this is the value of k units if he 
succeeds in doing so. Reexamining this in 
the context of a sequential auction as above, 
one sees that if the monopolist wanted to 
get k of n units available, he would have to 
pay P(j) for the jth unit he obtained (j < k). 
This is because in the subgame in which the 
entrant gets the unit in question, only the 
last unit would go to the monopolist, as in 
Lemma 1. Thus, if the monopolist had j —1 
units and the entrant obtained the next 
unit, the price in the subgame would be 
P(j), which is the entrant’s valuation and 
the price the monopolist would have to pay. 
The monopolist always gets the last unit, 
and if he already had k —1 units he would 
have to pay only P(k —1) for this. The cost 
of acquiring k units is thus less than P(k)k. 


Lewis also argues, on the basis of looking 
at the cost of preemption alone, that di- 
vestiture of capacity may be an attractive 
proposition for a firm. His basic insight is 
that divestiture reduces the price the firm 
has to pay for preemptively acquiring any 
given amount of capacity. However, this is 
again only part of the story. Assume, as 
before, that the firm has an initial capacity 
of M. If it chooses to divest itself of m units 
of capacity it enters a subgame in which it 
has M—m units and n+ m units of capac- 
ity are available. Figure 3 serves as a guide 
to the discussion below. 

In the equilibrium of the game, assuming 
it divests, the monopolist gets only one unit 
of capacity and pays P(M — m) for it.® En- 
trants pay P(M — m +1) for each unit. Thus 
the monopolist obtains mP(M —m+1) in 
revenue from the sale of m units of capac- 
ity. Finally, as the monopolist firm’s capac- 
ity in equilibrium is M — m +1, it gets prof- 
its of f(M — m-+1). If it does not divest, it 
gets profits of f(M +1) and pays P(M) for 


’Note that P(M — m) corresponds to the monopolist 
getting none of the m+n units available. 
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the capacity it does obtain. The difference 
in the payoff from not divesting and divest- 
ing equals 


(6) [f(M +1)- f(M-m+1) 
— P(M-—m+1)m| 
—[P(M)- P(M-m)]. 


Divesting reduces the profits ex post, as 
the monopolist firm has less of the total 
capacity and so charges a lower price and 
gets lower profits. However, it also obtains 
the revenues from the sale of capacity. The 
net effect of these is given by the first term 
in brackets. This is positive as P(-) is an 
increasing function. This is evident from 
Figure 3, where it equals the striped trian- 
gle, ABC. 

However, the monopolist firm pays a 
lower price for the one unit it purchases 
when it divests. This is the second term in 
the above expression. This term equals the 
area of the cross-hatched rectangle DEGH 
in Figure 3. While the first term works 
against divestiture, the second term works 
for it. The total effect could be positive or 
negative. 

If m=1 and P(-) is concave, then the 
gain from not divesting equals 


(7) [f(M+1)- f(M)- P(M)] 
-[P(M)- P(M-1)| 
={f(M +1) f(M)- P(M+1)} 
+{[P(M+1)- P(M)] 
-[P(M)- P(M -1)]}. 


If P(-) is concave, both terms are negative, 
and divestiture is profitable. 

If P(-) is linear and m units are divested, 
the first term in the general expression above 
equals sam’, while the second equals ma 
where a is the slope of P(-). Their differ- 
ence equals ma[(m /2)—1]. For m > 2, this 
is positive so that divestiture reduces profits 
if more than two units are divested; thus the 
potential for raising profits by divestiture is 
relatively limited. 
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While Theorem 1 and its applications all 
concern the dissipation of monopoly power, 
I now turn to an examination of some con- 
ditions under which monopoly would per- 
sist. For the two-stage example in the previ- 
ous section, I showed that if x > 2(C — x) 
the monopolist obtained both lots. For all 
units to go to the monopolist in the n-stage 
game, a generalization of this condition is 
needed. This is the content of Lemma 2 and 
Theorem 2. 

The relationship between f,_,(x) and 


f,{«) that would result if the monopolist 


were to obtain all additional units in every 
subgame is derived in Lemma 2. Theorem 2 
uses this relationship to determine the equi- 
librium outcomes. 


LEMMA 2: /f P is linear and, for all k, 


n 
Xp where X, = x; 


i=k 


then for all k, 


k 


i=2 


(See the Appendix for the proof.) 


THEOREM 2: If P(-) is linear and for all 
k, x, > 2X,.,, then in equilibrium: 


(i) all units go to the monopolist; 
(ii) the price per unit paid in stage k equals 
P(X, — x,); and 
(iii) the monopolist’s profits from participat- 
ing in the auction, f(0)— f,(0), are posi- 
five. 


PROOF: 

Parts (i) and (ii) follow from Lemma 2. 
Part (iii) then follows from Lemma 2 and 
the fact that at any stage, and following any 
history, it is profitable for the monopolist to 
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obtain the current lot. Hence, 


(9) fo(0) > fi(0) > f,(0) >... > 


While the model above has been formu- 
lated in terms of capacity acquisition, it 
affords another interpretation. Consider the 
problem of a monopolist who operates in 
more than one country. It is convenient to 
think of the monopolist as being foreign and 
operating in the home market and in an- 
other market. Arbitrage forces him to charge 
one price for his product. Let O( P) denote 
the demand at home. If he charges P he 
gets O(P)P revenue from the home market 
and r(P) from his other markets. Suppose 
that the home country imposes a quota at 
the level of imports denoted by V. If the 
monopolist charges P, then the price of a 
license to import one unit is L(P,V ), where 
L(-) is defined by Q(P + L)=V. Now sup- 
pose that these licenses are auctioned to the 
monopolist or to a large number of compet- 
itive domestic retailers.’ If the monopolist 
gets x of these licenses, he chooses a price 
to maximize: 


(10) w(P,x,V)=PO(P+L(-)) 
+r(P)+2L(-). 


Denote this price by P(x,V) and let 
a(P(x,V),x,V)= f(x,V). Note that by the 
envelope theorem f’(x,V)= L(P(x,V),V), 
which also equals a competitive retailer’s 
valuation of a license.'® For fixed V, this 
model has the same structure as the model 
of monopoly with a fringe. 


IV. The Importance of Timing 


The model studied implicitly assumes that 
the acquisition stage occurs before any pro- 
duction takes place. Thus, the acquisition 


If there are only a few retailers, then the model 
would not apply, as the marginal valuation of the 
retailers would lie below that of an entrant and be less 
than L(-) because retailers would take into account 
how getting more licenses would reduce the value of 
the licenses they already own. 

Generally L(-) is positive, as assumed here, if the 
quota is relatively restrictive. 
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stage is assumed to be brief compared to 
the life of the industry. This may be more 
appropriate in some settings (e.g., the quota 
license interpretation offered above) than 
others. In this section, I examine the effects 
of relaxing this assumption. 

The main issues can be seen by amending 
the simple two-stage example of Section II. 
As before, in the presence of a competitive 
fringe the monopolist is assumed to have 
enough capacity to produce whatever he 
chooses. As before, in each period one unit 
of capacity becomes available and can be 
acquired by either the monopolist or an 
entrant. However, production is now as- 
sumed to take place in each of the two 
periods. Thus, the value of winning the first 
unit of capacity now includes the profits 
from production in the first period. 

Let A,, be the second-period profits ac- 
cruing to the monopolist when he has both 
units of additional capacity. Similarly, let 
A,, be his second-period profits from having 
the first but not the second, let Ay, be the 
profits from having the second but not the 
first, and let Ay, be the result of having 
neither. Having obtained the first unit, the 
monopolist gets f(2) in the second period if 
he obtains the second unit. Hence, A,,= 
f(2). If he does not obtain the second unit, 
he only gets f(1), so = fC). Similarly, 
Avy, = fQ), and = f(0). 

The entrant’s bid for the second unit of 
capacity when the monopolist gets the first 
unit is P(1), and the bid is P(O) when the 
monopolist does not. It is easy to see that, 
as in the previous sections, the monopolist 
gets the last unit in both subgames. 

However, incentives in the first stage are 
now different. Let A, denote the mono- 
polist’s payoff evaluated at the beginning of 
the first stage when he gets the first unit. 
Let A, be similarly defined as his profits 
when he does not get the first unit. If 
the monopolist gets the first unit, he gets 
f(2) from production in the first period 
(as no capacity has gone to the fringe) and 
6 f(2)— P()] in present-value terms from 
the subgame, where 6 = 1/(1 +r) is the dis- 
count factor. If he does not get the first unit 
he gets f(1) from production in the first 
period (as one unit of capacity goes to the 


3 
| 
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fringe) and 6{ f(1)— P(O)] in present-value 
terms from the second period. An entrant is 
willing to pay P(11+ 6) for this capacity, 
as the monopolist gets the second unit, and 
the price is P(1) throughout. The difference 
between the willingness to pay of the 
monopolist and the entrant’s bid equals 


(11) f(2)-f()- PQ) 
+ 8{f(2)— f(1) PC) 
—[P(1) P(0)]}. 
In the linear case, this equals 


(12) (1-d5)a/2>0 
where, as before, a is the slope of the P(-) 
function. 

Hence, the monopolist gets both units! It 
is easy to verify that, if production (but not 
capacity acquisition) continues after the sec- 
ond period, the last unit would go to the 
monopolist. However, if r <1, the first unit 
would go to the entrant.!! 

When production occurs before the ac- 
quisition stage ends, acquiring capacity to- 
day raises profits today as well as raising the 
price of capacity acquisition in the future. 
The former works in favor of monopoly 
persisting, while the latter works against it. 
If r is small, the latter predominates, and 
monopoly power dissipates; while if r is 
large, the former predominates, and mono- 
poly persists. 


V. Conclusion 


I have argued that the sequential nature 
of the competition to acquire resources can 
have a powerful effect on their resulting 
allocation. In particular, even though large 
incumbent firms attach greater value to 
these resources than new entrants, some of 
these additional resources may go to en- 


‘The last unit goes to the monopolist as before. 
The first unit also goes to the monopolist if the 
monopolist’s willingness to pay at the first stage ex- 
ceeds that of the entrant. The difference in the two has 
the same sign as [1—(1/r)]a /2. 
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trants. The basic insight is that the prices of 
later units rise if earlier units are acquired. 

If the capacity-acquisition stage is short 
and unit size rises weakly, then monopoly 
dissipates. This also occurs if the capacity- 
acquisition stage is long but the future mat- 
ters a lot, either because the interest rate is 
low or because the life of the industry is 
long relative to the acquisition stage. On the 
other hand, if the capacity-acquisition stage 
is short, preemption occurs if the unit size 
falls sharply enough. If the capacity-acquisi- 
tion stage is long relative to the life of the 
industry, preemption occurs if the future 
matters little, either because of discounting 
or because production ends with period 2. 
Thus, the issue of the persistence of 
monopoly depends critically on the se- 
quence of lot size, the timing of capacity 
acquisition and production, and on the dis- 
count factor. 


APPENDIX 


LEMMA 1: If P(-) is linear or concave and 
if for all k, x; 4, then 


= + P(x)x,, 


fork =1 


(Al) f,-,(x)= fork >1 


so that by repeated application, for all k, 
(A2) = + — P(X) 


PROOF: 
For k = 1, 


+ 


(A3) f,-(x)= max{ 


= fi¢e+ x,)— P(x 


For k>1, the proof is by induction. For 


max{ + X,_1) 
AG 


| 

(A4)  f,-2(%) 
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where A(x,n — k) is the allocation, that is, 
the amount of new capacity obtained by the 
monopolist, in the subgame at the (n — k)th 
stage when the monopolist already has x 
units. Moreover, 


ini + x,)— P(x)x, 
(A5) fn—2(*) = max} + 2,4 +e 


since A(x,n—1)= x,,, as the monopolist al- 
ways wins in the last stage, and as Lemma 1 
holds for k =1. The lower term, L, is less 
than the upper one, U, as their difference 
L-U is 


(A6) ffx +2,) 
—P(x+ Xn-i1t 


P(x +x,-,+ x,)— P(x+X,) 


Xn-1 


2 


Xn-1 


x 


[P(x P(x+x,) 
+ Xn-1¥n 
Xn-1 
P(x + x,-1)— P(x) 
Xn-1%Xn 
Xn-1 


The first inequality occurs because x,,_, < 
X,- The second occurs because the first term 
is negative as P(-) is upward-sloping and 
the difference between the second and third 
terms is nonpositive if P(-) is concave. 
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Finally, assuming that the result holds for 
k, I show that it holds for k +1. Again, 


(A7) 


= max + Xn—k) 
| = P(x PAG 


As the result holds for k by assumption, 
A(n—k,x)=x,, and using the result for k 
repeatedly gives 


(A8) 


+ Xn)— P(x) Xp 


= max + Xn) 


P(x + 
Again, L — U equals 


n 


Xn-k 


+X,-4)- 
XnXn—-k 
Xn-k 


{f(x + f(x + x,) 


Xn-k + Xn)Xp 


Xn-k*n 
Xn—-k 


=] Xn-k*¥n 


<0. 


The first inequality holds as x,,_, < x,,. The 
second holds as the first term is negative 
and the difference between the last two is 
nonpositive since P(-) is concave. Thus, the 
induction step holds. 


3 
| 
| 
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LEMMA 2: [If P is linear and, for all k, 
> 
where X,, = L'_,.X; then for all k, 


k 
P{x+ 
i=2 


\ 


PROOF: 

The last unit always goes to the 
monopolist so that this holds for k = 1 since 
for k>n, x,=0. Thus, A(x,n—-1)=x,,. 
When k =2, 

(All) 


= max fy Xn 


+ P( x )X, 


(x+x,_,+x,) 
= max n 
=F 


The difference between the lower and up- 
per terms, L — U, is given by 


x,)— P(x + x,)%,-1) 
—[P(x+x,_,)— P(x)]x,. 


As P(-) is linear with slope a, 


(Al3) L-U=F(Xq-1) AX 


> 0 


as by assumption x,,_ , > 2x,,. Thus, f,,_ 
= P(x +x,)x as pos- 
tulated. 


n—-1 n-1 
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Now I show that, if the result holds for k, 
it holds for k +1. Note first that 


n 


k 
(Al4) 


i=n—k+2 


X; = X,_442- 


As the result holds for k by assumption, 


(A15) 


f(x + Xn-k+1) 


k 


x Xn—-K+il- 


Similarly, 


(Al16)  f, 


KA x+ Xn—k ) 


k 
(P(e + x, 
i=1 


+ +1— *n-k +i) Xn—k +il- 


Thus, 


(A117) 


(k+ i(*) 


fr g(x) 


Ax 
k 
i=1 
+ 
i=1 
P(x+ 


Xn—-k)Xn-k: 


The lower term less the upper one is de- 


2 
| 
| 
— 
| 


noted by L — U, and 


(A18) L-U 
= {f(x + X,-4)— + 
k 
i=1 
k+i° 


As P(-) is linear, 


(A19) L- Ua iy 
AX, 
>0 
as, for all k, x,_, >2X,_ 


Thus, the result holds for i: +1 if it holds 
for k. 
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A Structural Model of Peak-Period Congestion: 
A Traffic Bottleneck with Elastic Demand 


By RIcHARD ARNOTT, ANDRE DE PALMA, AND RosIN LINDSEY* 


This paper considers the modeling of road congestion subject to peak-load 
demand. The standard model contains ambiguities and is poorly specified. These 
problems can be eliminated by working with a structural model that explicitly 
treats the congestion technology and drivers’ behavioral decisions. The paper 
provides a detailed analysis of a particular structural model — William Vickrey’s 
model of bottleneck congestion in the morning rush-hour auto commute, 
extended to treat elastic (i.e., price-sensitive) demand—and examines some 
economic implications of the structural approach. (JEL R41) 


This paper considers the modeling of road 
congestion subject to peak-load demand. It 
argues that a properly specified model 
should be structural, that is, be derived ex- 
plicitly treating the congestion technology 
and consumers’ behavioral decisions. 

The standard model of a facility subject 
to peak-period congestion is specified as 
follows (e.g., Dennis J. Aigner and Joseph 
G. Hirschberg, 1985; Ronald R. Braeutigam, 
1989).! The period of use is divided into 


*Arnott: Department of Economics, Boston College, 
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ment of Civil Engineering, Northwestern Univer- 
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This paper was redrafted while the authors were visit- 
ing GREQE, Marseilles; we thank the group there for 
its hospitality. 

Early works on peak-load pricing include Jules 
Dupuit (1849), Arthur C. Pigou (1920), and Frank 
Knight (1924); all of these authors recognized the ap- 
plication of the principle of marginal-cost pricing to 
traffic congestion. Major contributions to the modern 
theory of peak-load pricing include Marcel Boiteux 
(1949), Peter O. Steiner (1957), Oliver E. Williamson 
(1966), and Michael A. Crew and Paul R. Kleindorfer 
(1986). Important papers applying the theory to urban 
transportation include Alan A. Walters (1961), Herbert 
D. Mohring and Mitchell Harwitz (1962), Robert H. 
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finite time intervals. A separate analysis is 
conducted in price—quantity space for each 
of these intervals. The quantity is the num- 
ber of facility users. Price is the sum of the 
user’s toll or fee and his private cost (net of 
the toll, but incorporating congestion). 
Marginal social cost is the increase in social 
cost in that interval from an additional user 
in that interval. Both private and marginal 
social cost are viewed as depending on the 
number of users and capacity and as being 
determined by the technology. The position 
of the demand curve depends on both the 
time interval and the equilibrium prices for 
the other time intervals (because of inter- 
temporal substitution). Equilibrium is at the 
point of intersection of the demand and 
supply (relating price to the number 
of facility users) curves, and the social 
optimum at the point of intersection of 
the demand (marginal-social-benefit) and 
marginal-social-cost curves. 

This specification contains a number of 
ambiguities. To illustrate, consider a styl- 
ized model of traffic in the morning rush 
hour in which congestion occurs at only a 
single bottleneck, the entry point to the 
central business district. This bottleneck has 
fixed capacity, and if the number of drivers 
arriving at the bottleneck exceeds this ca- 
pacity a queue forms. Even for this simple 


Strotz (1965), Marvin C. Kraus et al. (1976), and 
Theodore E. Keeler and Kenneth A. Small (1977). 


— 
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example, there are several respects in which 
it is not clear how to apply the standard 
model. First, what is the appropriate mea- 
sure of the number of users in a time inter- 
val? Is it the number of drivers joining the 
queue or the number passing through the 
bottleneck? This ambiguity stems from a 
failure to model the congestion technology 
explicitly. Second, the private cost in an 
interval depends not only on the number of 
users in that interval, but also on queue 
length, which reflects congestion in previous 
intervals. This illustrates a point that trans- 
port engineers have stressed: that conges- 
tion is inherently a dynamic phenomenon. 
Third, and relatedly, the addition of a driver 
in an interval increases the queue length 
and hence social cost not just in that inter- 
val, but in other intervals as well. This list of 
ambiguities suggests that the standard model 
of peak-load congestion in which the period 
of use is divided into intervals is poorly 
specified. 

In recent years a new structural model of 
peak-period congestion that does not con- 
tain these ambiguities has been extensively 
developed. The main innovation is to treat 
explicitly the user’s behavioral decisions. 
The literature has focused on the time-of- 
use decision, whereby the user trades off 
the cost of using the facility at an inconve- 
nient time against the congestion cost of 
using the facility when it is crowded. The 
structural approach also provides an explicit 
treatment of the congestion technology. 

The seminal paper is one by William 
Vickrey (1969), and the example we gave 
above is based on his model. He examined 
equilibrium with a fixed number of identical 
drivers on a point-input, point-output road 
in the morning rush hour, along which there 
is a single bottleneck of fixed capacity. If 
the arrival rate at the bottleneck exceeds 
capacity, a queue develops. All drivers wish 
to arrive at work at the same time. (Actu- 
ally, Vickrey [1969] considered a situation 
with a distribution of work start times, but 
the bulk of the literature assumes a com- 
mon start time.) This is physically impossi- 
ble. As a result, each driver faces a trade-off 
in deciding when to leave home. If she 
leaves early, she faces no queue but arrives 
at work inconveniently early; if she leaves so 


MARCH 1993 


as to arrive on time, she faces a long queue; 
and if she leaves late, she faces no queue, 
but arrives inconveniently late. Equilibrium 
obtains when the queue length over time is 
such that no driver can reduce her trip price 
by changing her departure time. Vickrey not 
only solved for the no-toll equilibrium, but 
also determined the social optimum and the 
time-varying toll which decentralizes it and 
made some astute observations concerning 
optimal capacity and equilibrium with two 
parallel roads. 

The Vickrey (1969) model has been elab- 
orated in a large number of papers in trans- 
portation science (e.g., Chris Hendrickson 
and George Kocur, 1981), but with the ex- 
ception of papers by the present authors 
(e.g., de Palma and Arnott, 1986; Arnott 
et al., 1989, 1990; Arnott and Kraus, 1990) 
and by Ralph M. Braid (1989) and Yuval 
Cohen (1987), it has been ignored by 
economists.” This is unfortunate since the 
rich economic implications of the model 
have not received the attention they merit. 
In this paper, we extend the model, focusing 
on its economics. Our paper makes four 
contributions to the literature on peak- 
period congestion. 

First, the paper illustrates the power of 
the structural approach in analyzing alter- 
native pricing regimes and provides the first 
treatment of elastic (i.e., price-sensitive) trip 
demand in the context of the Vickrey (1969) 
bottleneck model (but see footnote 2). Equi- 
librium is characterized for four pricing 
regimes (in order of increasing sensitivity: 
no toll, the optimal uniform toll, the opti- 
mal step [‘‘coarse’’] toll, and the optimal 
time-varying [‘‘fine”] toll) with capacity ex- 
ogenous and then with capacity chosen opti- 
mally. The four regimes are then compared. 


"Arnott et al. (1990) formalizes Vickrey’s (1969) 
model, retaining his assumption of inelastic trip de- 
mand, and extends it to treat a coarse toll and to solve 
for optimal capacity. Cohen (1987) and Arnott et al. 
(1989) extend the Vickrey model to allow for driver 
heterogeneity. Braid (1989), who wrote his paper inde- 
pendently of ours, compares the social optimum and 
competitive equilibrium when trip demand is elastic 
and capacity fixed, with a general trip-cost function. 
Finally, Arnott and Kraus (1990) examines Ramsey 
pricing, employing the bottleneck model. 
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Among other results, we show that, with 
demand elasticity less than unity, optimal 
capacity is larger the less sensitive is the 
(optimal) toll. 

Second, the paper provides insight into 
the appropriate use and interpretation of 
the standard model of a congestible facility 
with peak-period congestion. In the Vickrey 
(1969) model the departure rate from home 
over the rush-hour period adjusts such that 
trip price is constant over the period. Equi- 
librium is therefore determined simultane- 
ously over the entire period, and it makes 
no sense to compute equilibrium over only 
part of the period. Thus, the standard pro- 
cedure of dividing up the period of use into 
intervals and solving separately for equilib- 
rium in each interval is conceptually un- 
sound. We show, however, that the standard 
model can be interpreted as providing a 
reduced-form representation of the struc- 
tural Vickrey model (since price and the 
number of users are endogenous, “semi- 
reduced” form is perhaps more appropriate 
terminology), as long as the entire rush-hour 
period is treated as a single interval. 

Third, the results of the paper reinforce 
those in Arnott et al. (1990) in suggesting 
that the gains from the efficient pricing of 
road congestion may be considerably greater 
than those estimated in previous studies 
(reviewed in Arnott et al. [1990]). Pricing 
influences an individual’s choice of home 
location, lot size, trip frequency, mode and 
route choice, and departure time. Also, by 
influencing land rents, the form and level of 
urban transport pricing affects urban spatial 
structure. The literature in urban economics 
has treated the rush hour as a period of 
fixed length with uniform congestion. In do- 
ing so, it has been able to incorporate all 
the margins of adjustment noted above ex- 
cept individuals’ departure-time decisions. 
That this paper and Arnott et al. (1990) 
obtain efficiency gains from road tolling that 
are substantially greater than those previ- 
ously estimated suggests that a considerable 
fraction of the gains result from the change 
induced in the time pattern of road usage 
over the peak period. This in turn suggests 
that more serious consideration should be 
given to pricing schemes in urban transport 
that smooth the peak; it also suggests that 
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the benefits from employing technologically 
sophisticated pricing schemes for urban 
roads, of the sort advocated for many years 
by Vickrey (1963, 1971) and discussed in 
Sanford F. Borins (1986) and Kraus (1989), 
may have been substantially underesti- 
mated. 

Fourth, the paper discusses the degree to 
which roads should be self-financing. 
Mohring and Harwitz (1962) and Strotz 
(1965) (MHS hereafter) proved that if the 
total cost associated with a road (private 
costs plus capacity cost) doubles when both 
capacity and usage double, then revenue 
from the optimal toll equals the cost of 
constructing and maintaining optimal capac- 
ity, so that the road should be self-financ- 
ing. MHS also examined the cases in which 
total cost is homogeneous of degree greater 
and less than one in capacity and usage.° 
We show that the MHS results extend to 
the basic bottleneck model independently of 
the tolling regime, as long as the toll is set 
optimally conditional on the regime. This 
result has important implications. For ex- 
ample, if toll collection on roads is pro- 
hibitively expensive and a gasoline tax is the 
only policy instrument available to the gov- 
ernment which affects the price of a trip, 
then with optimal capacity (conditional on a 
gasoline tax) the tax rate should be raised to 
the point at which revenues from the tax 
satisfy the MHS conditions. 

We shall develop our analysis using the 
Vickrey (1969) model with identical individ- 
uals. Subsequently, we shall discuss briefly 
how our results extend to heterogeneous 
individuals. Finally, we shall comment on 
how the structural approach to modeling 
urban road congestion can be adapted to 
develop structural models of other con- 
gestible facilities. 

Section I examines the economics of the 
basic Vickrey (1969) model of road conges- 
tion. Equilibria with elastic trip demand and 
fixed capacity for the four pricing regimes 
are solved for and compared in Section II. 


*Mohring and Harwitz (1962) and Strotz (1965) de- 
veloped their models specifically in the context of 
transport congestion. Williamson (1966) derived the 
same result in a more specific model, but with a broader 
context. 
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The analysis is extended to treat optimal 
capacity in Section III. Section IV investi- 
gates the self-financing results in this con- 
text. A numerical example is presented in 
Section V. Generalizations are discussed in 
Section VI, and concluding comments are 
given in Section VII. 


I. Review of the Basic Model: Fixed 
Number of Drivers 


A. The Characteristics of Demand and 
the Congestion Technology 


Every morning N identical individuals 
travel from home to work along the same 
road, each in her own car. Travel is uncon- 
gested except at a single bottleneck through 
which at most s cars can pass per unit time; 
if the arrival rate at the bottleneck exceeds 
s, a queue develops. Thus, the capacity con- 
straint is a flow constraint, while the queue 
discipline is first-come, first-served (FIFO). 

Travel time from home to work is 


(1) T(t) =T'+T*(t) 


where T' is fixed travel time, TY is variable 
travel time, and ¢ is departure time from 
home. Without affecting results of interest, 
we set 7‘ =0; thus an individual arrives at 
the bottleneck as soon as she leaves home 
and arrives at work immediately upon leav- 
ing the bottleneck. Let D(t) be the queue 
length (i.e., number of cars). Then, an indi- 
vidual’s queuing time equals queue length 
at the time she joins the queue divided by 
bottleneck capacity: 


(2) rye. 


Let ¢ denote the most recent time at which 


there was no queue and let r(t) be the 
departure rate function (from home). Then, 


(3) D(t) = f'r(u) du— s(t 


The price of a trip is the sum of private 
cost and the toll: 


(4) p(t)=C(t)+ F(t). 
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Private cost, in turn, is taken to be linear in 
travel time and schedule delay (time early 
or time late): 


(5) C(t)=aT’(t) + B(time early) 


+ y(time late) 


where a is the shadow cost of travel time, B 
is the unit cost of arriving early at work, and 
y is the unit cost of arriving late. In accor- 
dance with empirical results (Small, 1982), 
we assume that y>a> B. Following con- 
vention, we call the cost of arriving at work 
early or late “schedule delay cost” and call 
T ‘(t) “travel time cost.” The desired arrival 
time is t*. Thus, “time early” is max{0, ¢* — 
t—T*(t)] and “time late” is max[0,¢+ 
T(t)— 

Each individual decides when to leave 
home. In doing so, she trades off travel 
time, schedule delay, and the toll. Equilib- 
rium obtains when no individual can de- 
crease her trip price by altering her depar- 
ture time, taking all other drivers’ departure 
times as fixed. Thus, the equilibrium is a 
pure-strategy Nash equilibrium with depar- 
ture times as the strategy variables. This 
seems to be the natural equilibrium concept 
to employ in this context.* 


B. The No-Toll Equilibrium 


Let t, be the beginning of the rush hour, 
let ty be the end, and let ¢ be the departure 
time for on-time arrival [f = — T‘(7)]. For 
early arrival [t €[t,,7)], the equal-trip-price 
condition is 


(6) p(t)=p=aT*(t)+Blt* —t-T(t)). 


“In this context, pure strategies appear to be more 
realistic than mixed strategies, since most individuals 
prefer a routine—to leave home at the same time 
every day. The mixed-strategy equilibrium is the same 
if aggregate stochasticity is ignored. Moshe Ben Akiva 
et al. (1986) demonstrate convergence to the no-toll 
Nash equilibrium via a specific adjustment process in 
which, out of equilibrium, individuals adjust their de- 
parture times according to a particular probabilistic 
decision rule. 


3 


VOL. 83 NO. 1 


Differentiation of (6) yields 


aT‘(t) 


) 


From (3), 


(8) 


— 


dt 
Combining (2), (7), and (8) gives 


as 


(9) r(t)= 


for 


By a similar argument, it can be shown that 


as 


(10) r(t)= 


for 
Thus, the queue length evolves over the 
rush hour such that the equal-trip-price 
condition is satisfied. 

The individuals who depart at the begin- 
ning and end of the rush hour incur only 
schedule delay cost, which must be equal in 
equilibrium. Since arrivals are continuous 
over the rush hour, the length of the rush 
hour is N/s. These results together imply 


(11a) 


(11b) 


Furthermore, it is easily shown that 


N 
(12) 
s 


The solution is depicted in Figure 1. The 
vertical distance between the cumulative 
departures schedule and the cumulative ar- 
rivals schedule is queue length, and the 
horizontal distance is travel time [ D(t’) and 
T ‘(t'), respectively, in the figure]. The queue 
builds up linearly from ¢, to ¢, and then 
dissipates linearly until it disappears at f,,. 
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Cumulative 


departures 


Cumulative 
arrivals 


Cumulative departures and arrivals 


FiGuRE |. THe No-Toit Equitiprium 


Let TTC be total travel time cost, let 
SDC be total schedule delay cost, and let 
TC be total travel cost. In Figure 1, TTC is 
a times area ABCA, and SDC is B times 
area AFGA plus y times area CFHC. It is 
straightforward to show that 


2 


(13) TTC*(N,s) = 
AY 


N2 


where 


(14) By 
Bry 


and superscript e denotes the no-toll equi- 
librium.° 

Since there is no toll, price equals private 
cost, which is the same for everyone and 
equals average total travel cost, ATC = 


Note that all these aggregates are independent of 
a. The start and end of the rush hour are independent 
of a, and therefore, so is total schedule delay cost. 
Since TTCS = SDC‘, TTC* and TC* too are indepen- 
dent of a. A doubling of a@ halves queue length, 
preserving travel time cost. 


| 
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TC/N;; that is, 


We may calculate marginal social cost 


(MSC) from (13): 


orc’ 2aN 
(16) MSC* = = 
ON S 
=2 = 2ATC*. 


Note that marginal social cost is indepen- 


dent of departure time. The reason is that 


other drivers adjust their departure times in 


such a way that the departure-rate function 


(including the additional traveler) is inde- 
pendent of the additional traveler’s depar- 


ture time. 


Several observations are in order: 


1. In the no-toll equilibrium, total schedule 
delay cost equals total travel time cost. 
The model highlights the costs of travel- 
ing at inconvenient times, which are hid- 
den in the standard model. 
. Equation (15), after substitution of (13) 
and (14), characterizes price as a func- 
tion of N and s. The resulting function is 
therefore the reduced-form supply func- 
tion, to which a trip demand function 
may be added to determine the full equi- 
librium. The supply function captures the 
technology, the pricing regime consid- 
ered, and commuters’ time-of-use deci- 
sions. The demand function captures 
commuters’ frequency-of-use decisions. 

. Downs’s law (Anthony Downs, 1962), 
which is based on observation, states that 
an expansion of road capacity will reduce 
the length of the peak period more than 
the level of congestion at the peak. In 
the bottleneck model with inelastic de- 
mand, a doubling of capacity halves the 
length of the rush hour and the maxi- 
mum travel time but leaves the maximum 
queue length unaltered. 

. Queuing time in the model is pure dead- 
weight loss. If the departure rate were 
set at s between ¢, and f,, queuing 
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would be eliminated. Furthermore, since 
the time pattern of arrivals would be the 
same as in the no-toll equilibrium, total 
schedule delay cost would be unchanged. 
Thus, the social saving from the change 
would equal total travel time cost in the 
no-toll equilibrium, which equals one-half 
of total travel cost. This example pro- 
vides a stark illustration of the potential 
social saving that can be achieved through 
altering the time pattern of departures, 
even when the total number of trips and 
the distribution of arrival times are held 
constant. 


. Private cost and marginal social cost are 


determined by the pattern of congestion 
over the entire period of use, and not, as 
is assumed in the standard analysis, by 
the number of users over some portion 
of the period. 


. Marginal social cost should be computed 


mutatis mutandis, not ceteris paribus; 
that is, marginal social cost is computed 
incorrectly if one adds a commuter and 
computes the increase in social cost from 
doing so, without allowing other drivers 
to adjust their departure times. The rea- 
son why computing marginal social cost 
ceteris paribus is incorrect is that the 
envelope theorem does not hold. Be- 
cause there is no toll, prices are dis- 
torted. Consequently, the adjustments 
that commuters make in their departure 
times in response to the added driver 
alter the deadweight loss from unpriced 
congestion. 


. We shall see later that a change in the 


form of pricing alters the private and 
marginal social cost functions. Thus, con- 
trary to the standard model, the private 
and marginal social cost functions are 
not completely technologically deter- 
mined, but depend as well on the form of 
pricing. This is because the cost func- 
tions capture consumers’ time-of-use de- 
cisions, which depend on the form of 
pricing, even with N and s fixed. 


II. Elastic Demand, Capacity Arbitrary 


In the previous section, we characterized 


the reduced-form supply function in the ab- 


|__| 
N 
) 
6 
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sence of a toll. We now incorporate variable 
trip demand. 
The demand for trips is assumed to be 


dN 


(17) N=N(p) —<0O. 


The consumers’ surplus from travel with 
price p is 


CS(p) = do’ 


Dp 


(18) 


while the social surplus (gross of capacity 
costs) from travel, with price p and average 
toll 7, is the sum of consumers’ surplus and 
toll revenues: 


(19)  SS(p,r) =CS( p) + Nr. 


A. The No-Toll Equilibrium 


From (13) and (15), 
(20) 


which is the supply function for trips in the 
absence of a toll. Solving this simultane- 
ously with the demand function (17) yields 
the equilibrium trip price and number of 
trips, p*(s) and N“(s), where a “*” over a 
variable indicates that it is a function of 
only s (plus exogenous parameters). We also 
have 


(21) CS (s) (s)= f N(p') dp 
p(s 


We now derive the reduced-form supply 
functions for the three toll regimes, as was 
done in the previous section for the case of 
no toll. Subsequently, we shall characterize 
equilibrium for the various toll regimes and 
then compare the four equilibria. 


B. Alternative Tolling Regimes 


We shall consider three different tolling 
regimes: a uniform toll, a fine toll, and a 
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coarse toll. The uniform toll is constant 
throughout the day. It does not alter the 
pattern of congestion over the rush hour, 
but with elastic trip demand it causes a 
reduction in the number of trips. The fine 
toll is a completely flexible time-dependent 
toll. The coarse toll is intermediate between 
the uniform and fine tolls. It combines a 
lower off-peak toll with a higher peak toll 
and is characterized by four parameters: the 
magnitudes of the peak and off-peak tolls, 
as well as the times at which the peak toll is 
applied and later removed. The average 
level of the coarse toll discourages travel, 
and the toll differential between the peak 
and off-peak periods shifts traffic from the 
peak to the off-peak period. 

For each toll, we assume throughout the 
paper that the parameters of the toll are set 
optimally. The derivations of the optimal 
tolls are presented in Arnott et al. (1990); 
here we record the results and provide some 
intuition for them. 

Uniform Toll.—Since the uniform toll 
adds a constant fee to each trip, for fixed N 
it does not alter the departure pattern [re- 
call the derivations of (9) and (10)]. Thus, 
total travel cost is related to N and s in the 
same way as in the no-toll equilibrium; that 
is, 


(22) 


where superscript u denotes the uniform 
toll. Thus, 
(23) MSC" = 2ATC’. 

Since price and the toll are the same for all 
commuters, so too is private cost. Also, av- 


erage travel cost equals (average) private 
cost. Thus, 


(24) +7". 


For efficiency, the price of a trip equals 
marginal social cost. Hence, using (23) and 
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(24), one obtains 
(25) ATC" p" = MSC" =2ATC" 
ATC. 


Thus, the optimal toll equals average travel 
cost. 

Fine Toll.—The social optimum can be 
decentralized with a time-varying toll. The 
social optimum entails no queue and there- 
fore zero total travel time cost. Also, for a 
given number of travelers, the beginning 
and end of the rush hour are the same as in 
the no-toll equilibrium. Thus, SDC°(N,s) 
while TTC°(N,s)=0, so 
that, from (13), 


5 N? 
(26)  TC°=SDC°= 
ys Ss 


where a superscript o denotes the social 
optimum. Efficiency requires that each trav- 
eler pays marginal social cost which equals 
twice the average travel cost. In the absence 
of a queue the person who arrives on time 
faces zero travel cost and for efficiency must 
therefore pay a toll equal to twice the aver- 
age travel cost. Furthermore, the persons 
who depart first and last incur twice the 
average schedule delay and hence twice the 
average travel cost and should therefore pay 
no toll. Finally, because of the linearity of 
(5), the fine toll increases linearly from ¢, to 
t* and then decreases linearly from t* to 
ty. The average toll equals average travel 
cost. 

Coarse Toll.—For this toll, too, the aver- 
age toll paid equals average travel cost. This 
toll is more efficient than a uniform toll 
since it alters the queuing pattern over the 
rush hour, but since it does not completely 
eliminate queuing, it is not as efficient as 
the fine toll. It is shown in Arnott et al. 
(1990) that, with application of the coarse 
toll, 


N?2 
4 (B+y)(at+B)] s 
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where superscript c denotes the coarse toll.° 
We collect the above results in the following 
proposition. 


PROPOSITION 1: Let j=e,u,c,o index 
the pricing regime. Then, 


(28a) TC/(N,s)= 
(28b) ATC’(N,s)= 
(28c) MSC/(N,s) =2ATC/(N,s) 
where 


(29a) re=T'=1 


(29b) Po = 
(29c) TS=—]|3- : 
4 (Bt+y)(at+y) 
Also, 
(30a) p*(N,s)=ATC*(N,s). 


Furthermore, for j = u,c,0, where t/ denotes 
the corresponding average toll, 


(30b) p/(N,s)= ATC’(N,s)+7/(N,5) 
and 


(31) p*(N,s)=MSC/(N,s) =2ATC/(N,s) 


°The (optimal) coarse toll has the following charac- 
teristics. It is applied at the front of the queue. The 
peak toll is turned on before t* and off after *. In the 
early morning, well before the peak toll is applied, a 
queue builds up. Then, for a period of time immedi- 
ately before the peak toll is applied, there are no 
departures, and the queue length decreases to zero at 
the moment the peak toll takes effect. The queue then 
rises until ¢* and falls after ¢* until it reaches zero just 
before the peak toll is lifted. Right after the peak toll is 
lifted, there is a mass of departures and no departures 
subsequently. 


| 
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which together imply 
(32) 7/(N,s)=ATC!(N,s) 
MSC/(N,s)  p’(N,s) 


4 


With the tolls set at their optimal levels, 
commuters pay the marginal social cost of 
each trip. Thus, the marginal-social-cost 
functions for the three toll regimes given in 
(28) and (29) are the corresponding 
reduced-form supply functions. 


C. Equilibrium for the Toll Regimes 


From (28) and (31), for j=u,c,o, the 
trip supply function for the various toll 
regimes is 


2T/5N 
(33) pi(N,s) = 


and equations (33) and (17) together implic- 
itly give N’(s) and p(s), while (32), (33), 
and (17) give 7/(s). Equilibrium for a partic- 
ular toll regime is portrayed graphically in 
Figure 2. The diagram is standard. How- 
ever, the demand and cost curves are de- 
fined over the entire rush hour, and it makes 
no sense to define them over an interval of 
the rush hour. Thus, the standard graphical 
treatment of traffic congestion is consistent 
with the analysis of this paper when the 
rush hour is treated as a single period, but 
not when the rush hour is divided into time 
intervals. 
We also have, for j = u,c,0, 


(34) CS'(s)= N( p) dp 


p(s) 
and 
(35) SS'(s)=CS (s) + Ni(s)#(s). 
D. Comparison of the Four Regimes 
Together (28), (29), (31), and (17) imply 


(36) > > B°(s) = 6°(s) 
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p 


p (N,s) = MSC’(N,s) = 2ATC'(N;s) 


ATC’(N,s) 


| 
| 


N 


RE 2. EQUILIBRIUM WITH THE OPTIMAL TOLL 
UNDER TOLL REGIME j: 7/(s)=ATC (s) 


which, with variable demand, in turn im- 
plies 


(37) N"(s)<N*(s) <N°(s)=N(s). 


The explanation of these inequalities is 
straightforward. For a given number of trav- 
elers, the price of a trip is lowest for the 
no-toll equilibrium (since no toll is charged) 
and the fine-toll equilibrium (since travel on 
the road is efficient), intermediate for the 
coarse-toll equilibrium, and highest for the 
uniform-toll equilibrium. The coincidence 
of the no-toll and fine-toll supply functions 
reflects the particular congestion technology 
assumed.’ 

One can calculate the efficiency loss, rela- 
tive to the fine toll, of the other pricing 
regimes: 


(38) EL’(s)=SS (s)-SS (s) j=c,u,e. 


In particular, SDC®* = = SDC° and TTC? = 
0, which imply that ATC’ =2ATC®, so that p°= 
2ATC® = = p*. 


169 
\ 
| 
T (s) 
ATC y 
¥ N=N(p) 
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The efficiency loss in the no-toll equilibrium 
equals the toll revenue collected at the 
fine-toll equilibrium. The efficiency loss in 
the uniform-toll and coarse-toll equilibria 
equals the corresponding loss in consumers’ 
surplus plus the loss (possibly negative) of 
toll revenue. Since the no-toll situation is a 
special case of a uniform toll, which is in 
turn a special case of a coarse toll, it is 
evident that 


(39) EL (s)>EL (s)>EL (s)>0. 
III. Elastic Demand, Optimal Capacity 


Optimal capacity is that capacity which 
maximizes gross social surplus less capacity 
costs; it occurs where the marginal benefit 
from capacity expansion (the gain in social 
surplus) equals the marginal cost. Let K(s) 
be the capacity—construction-cost function. 
The marginal capacity-expansion cost is 


dK(s 
MC(s) = 


(40) 


A. The No-Toll Regime 


Since there is no toll, the marginal benefit 


is simply the gain in consumers’ surplus. 
From (21), 


Me 


B. The Toll Regimes (j = u,c,0) 


From (35), the marginal benefit from ca- 
pacity expansion equals the marginal con- 
sumers’ surplus plus the marginal toll rev- 
enue; that is, 


dCS'(s) 


(42) MB (s)= 7 
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From (34), 
j 
dCs (s at, 
dCs (s) = N'(s) ( ) 
ds d 


and from (32), 


ds 2 ds 


Thus, from the above, and using (28b) and 
(32), 


(43) MB’(s)=- 
as 
_ aN\(s) 
= Ni(s)#/(s) 
d#4(s) 
#i(s) ds 


_ Ni(s)#(s) 
Ss 
TC (s) 


which has the following interpretation: since 
the tolls are set at the optimal level condi- 
tional on the tolling regime (with the result 
that commuters pay the marginal social cost 
of a trip), the analysis is first-best.2 As a 
result, the envelope theorem holds, which in 
this context implies that the marginal bene- 
fit of an incremental capacity expansion is 
the same before and after individuals adjust 


‘There is a difficulty of terminology here. There are 
two qualitatively different sources of inefficiency possi- 
ble: the /evel (or absence) and the form of the toll. We 
refer to a situation where travelers face the marginal 
social cost of a trip as first-best, even when the form of 
the toll is not optimal. 
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trip frequency in response to the capacity 
expansion. Since total travel cost with N 
fixed is proportional to the inverse of s, the 
marginal benefit of capacity expansion with 
trip frequency fixed is TC’ /s, which equals 
the marginal benefit of capacity expansion 
with trip frequency variable. 

Recall from (36) that p"(s)> p‘(s)> 
p(s). With elasticity of demand (e) less 
than unity, it follows that N“(s)p"(s)> 
N“(s)p°(s) > N°(s)p°(s), which in turn im- 
plies from (43) and (31) that 


(44) MB (s)>MB (s) >MB (s) 


for «<1. 


We assume that for each regime the 
marginal-cost curve cuts the marginal- 
benefit curve from below. Then optimal ca- 
pacity is determined by setting marginal 
benefit equal to marginal cost, and from 
(40) and (44) 

(45) for e <1 


> 
where “*” subscripts denote “at optimal 
capacity.” Thus, when « < 1, optimal capac- 
ity is larger the less sensitive the tolling 
regime. The less sensitive the tolling regime, 
the higher is average travel cost given ca- 
pacity. Furthermore, since price equals twice 
average travel cost for each toll regime, with 
€ <1 total travel cost is higher the coarser is 
the tolling regime given capacity. Finally, 
the marginal benefit from capacity expan- 
sion, for each s, is directly proportional to 
total travel cost. Hence, for each level of 
capacity, the marginal benefit from capacity 
expansion is higher, the less sensitive the 
tolling regime. The same line of argument 
implies that when «> 1, optimal capacity is 
smaller, the coarser the toll regime. 

From (40) and (43) and using p= 
2ATCi, = 2TCi, / Ni, 


_ aK(s) 
Ni pi, = 254 
ds 


Thus, as long as the elasticity of marginal 
construction costs with respect to capacity is 
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greater than —1, a condition we assume to 
hold, 
(46) 


DS > Nope Nop for <1. 


This implies that, for « <1, 


(47) D's > D> DY 


and, with «> 0, 
< 


The cases of ¢=1 and e>1 can be found 
in Arnott et al. (1987).° 


C. Comparison of No-Toll and Toll Regimes 


It remains to compare the no-toll equilib- 
rium with optimal capacity to the various 
toll equilibria with optimal capacity. Unfor- 
tunately, the results are not as clean as for 
the comparison of the toll equilibria with 
optimal capacity since, with the price of a 
trip in the no-toll equilibrium below 
marginal social cost, the determination of 
optimal capacity is a second-best problem. 
Equation (43) does not apply in the no-toll 
equilibrium because the envelope theorem 
does not hold (the marginal benefit from 
capacity expansion in the no-toll equilib- 
rium includes the change in the efficiency 
loss due to underpriced congestion caused 
by the induced increase in trip frequency). 
However, some results can be obtained, and 
these are presented in Arnott et al. (1987). 

The empirical evidence strongly supports 
the hypothesis that rush-hour car travel, 
which is predominantly commuting, is 
price-inelastic (e.g., Daniel McFadden, 
1974: Small, 1983; John Pucher and Jerome 
Rothenberg, 1976). Combining this stylized 
fact with (45), we have the prediction that 
optimal capacity falls as the “sensitivity of 
the tolling regime” increases. The model 
also predicts that, with constant elasticity of 


“It is interesting to note that, with optimal capacity 
and constant unit construction cost, MB’(s)=TC’/s 
= k = MC(s), which implies TC4, = ks/,; that is, capac- 
ity construction cost equals total travel cost. This result 
is due to the form of congestion. 


172 THE AMERICAN ECONOMIC REVIEW 


demand less than 1, optimal capacity in the 
no-toll regime is higher than under all the 
tolling regimes (see Arnott et al., 1987). 

It is worth restating that there are two 
separate issues involved in comparing opti- 
mal capacity between regimes. The first is 
whether “second-best capacity” (optimal ca- 
pacity conditional on congestion being un- 
derpriced) is less than or greater than 
first-best capacity (where congestion is ef- 
ficiently priced). This is a classic problem 
that has been well researched in the context 
of the standard model of traffic congestion 
(e.g., William C. Wheaton, 1978; John D. 
Wilson, 1983; Edmond L. d’Ouville and 
John F. McDonald, 1990). Broadly speak- 
ing, our results are consistent with these 
papers, though not fully comparable be- 
cause their treatment of congestion is static. 
The second issue entails the comparison of 
first-best toll regimes that differ in their 
sensitivity. Here our analysis is new, and the 
results are clean. 

For future reference, we note that with 
optimal capacity the efficiency losses associ- 
ated with the coarse-, uniform-, and no-toll 
equilibria compared to the optimal fine-toll 
equilibrium are 


dp + ~— 
Py 
+ ) j=e 
(48) ELi= 
Ni 
[?*N(p) dp + * Dx 
PS 2 2 


— K(s8)+ K(si,) j=u,e. 


The results of the previous two sections 
are brought together in the following propo- 
sition. 


PROPOSITION 2: 


A 


(i) > p(s) > p(s) = p(s); 
(ii) EL’, > 
EL", > 
(iii) with price-sensitive trip demand, N“(s)< 
N“(s)<N%s)= Ns); 
(iv) with <1, si 
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(v) with e <1 and [d?K(s)/ds*]s + 
dK(s)/ds>0, p. > pS > pS, and (if 
e>O0) NE<NE<N2. 


IV. The Self-Financing of Capacity 


Mohring and Harwitz (1962) and Strotz 
(1965) investigated the extent to which a 
network of roads should be self-financing. 
In the single-period version of their analy- 
sis, the private cost (C) is a function of 
capacity (s) and the number of trips (N). 
The planner chooses price and capacity so 
as to maximize social surplus [consumers’ 
surplus plus toll revenue, R=(p—C)N] 
minus capacity costs, K(s); that is, 


(49) max dp’ 
“p 


K(s). 


The corresponding first-order conditions for 
p and s are: 


ON d dp 
and 
(50b) N ) 
-~—N-—=( 
Os ds 


respectively. Equation (50a) reduces to 
(50a’ ) 


Equation (50a’) states that price should 
be set equal to marginal social cost, the sum 
of the private cost (C) and the marginal 
congestion externality ([@C/@NJN), and 
(50b) states that capacity should be con- 
structed up to the point where marginal 
cost (dK /ds) equals marginal benefit 


(—[dC /ds]N). Assume to simplify that C(-) 
is homogeneous of degree A* in N and s 
and that K(-) is homogeneous of degree h* 
in s. Then, using Euler’s Theorem, (50b) 


| 
) 
dC dN dN 
= 
| 


VOL. 83 NO. 1 


may be rewritten as 


aC 
(51) = + NRCC. 
ON 
The optimal toll, +, should be set equal 
to the marginal congestion externality, 
(0C /ON )N. Thus, 
(52) R= Nr=h*K + 
where R is toll revenue. Equation (52) im- 
plies that, if there are constant costs to 
capacity expansion so that h“ =1 and if a 
doubling of capacity and the number of 
trips leaves the cost of a trip unchanged so 
that h° = 0, then R = K;; that is, optimal toll 
revenue exactly covers the cost of construct- 
ing optimal capacity. One can extend the 
analysis to compute the proportion of the 
cost of capacity construction that is financed 
from toll revenue when either the toll or the 
level of capacity is nonoptimal. 

The issue of interest here is the form of 
the self-financing results in the bottleneck 
model. Obviously, in the no-toll regime, no 
toll revenue is collected. We therefore re- 
strict our attention to the uniform, coarse, 
and fine tolls. The analog to (49) is 


(53) max p’) dp’ 


N(p’) 


— j=u,Cc,O. 
It is evident that (53) is simply a particular- 
ization of (49) and, hence, that the MHS 
self-financing results hold for our model. 
Since ATC(-) is homogeneous of degree 
zero in N and s for each tolling regime 
(recall Proposition 1), equation (52) reduces 
to 

(54) Ri, j= u,c,o. 
Independent of the tolling regime, the ratio 
of the revenue collected from the optimal 
toll to the construction cost of optimal ca- 
pacity equals the elasticity of construction 
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cost with respect to capacity. This is re- 
markable since it indicates that the optimal 
degree of self-financing of a road is indepen- 
dent of the form of the pricing system em- 
ployed; for example, if a road system should 
be self-financing when a sophisticated tolling 
system is employed, it should also be self- 
financing when only a flat parking fee is 
applied. 

The major result of this section is summa- 
rized in the following proposition. 


PROPOSITION 3: The ratio of the revenue 
collected from the optimal toll to the costs of 
constructing optimal capacity equals the elas- 
ticity of construction cost with respect to ca- 
pacity, whatever the tolling regime. With con- 
stant costs of capacity, the road should be 
self-financing; with increasing costs, it should 
generate a surplus; and with decreasing costs, 
it should operate at a loss. 


Vv. A Numerical Example 


We start by considering the situation in 
which capacity is fixed. Subsequently, we 
extend the example to treat optimal capac- 
ity. 


A. Capacity Fixed 


We normalize so that N° =1, and then 
set s = 0.4 cars/hr so that the length of the 
rush hour in the no-toll equilibrium, ty — tg 
= N©/s=2.5 hours. Thus, p* = 6(N*/s) 
[from (20)]=$2.5(6). Consistent with col- 
umn 1 of table 2 in Small (1982), we take 
a = $5.00/hr, B = $3.05/hr, and y= 
$11.88/hr.'° Hence, 5 = By/(B + y) = 
$2.425/hr, p* = $6.063, and = 0.7292 


“Small (1982) estimated the ratios B/a and y/a. 
To construct an estimate of a we draw on Small (1991 
pp. 2-54), who concludes that “...a reasonable aver- 
age value of time for journey to work is 50 percent of 
the gross wage rate....”’ Bureau of Labor Statistics 
(1991) reports average 1990 hourly earnings of $10.03. 
Half this is $5.00/hr in round figures, which we take as 
our value of a. Since the Bureau of Labor Statistics 
figure excludes supervisory and government workers, it 
understates the average wage for the whole population 
and thus leads to a conservative value for @ as con- 
structed. 
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TABLE 1—NUMBER OF COMMUTERS, CosT OF TRIP, 
AND EFFiciENCY Loss wiTH VARIOUS TOLLING 
REGIMES WHEN Roap Capacity IS FIXED 
(N° =1, s=0.4 vehicles /hr, = $5.00/hr, 

B = $3.05 /hr, y = $11.88 /hr) 


Tolling regime 


Variable € e u c oO 
0.2 1 0.8909 0.9390 1 
0.7071 0.8280 1 
pi 0 6.063 12.125 8842 6.063 
0.2 6.063 10.802 8.304 6.063 
l 6.063 8574 7.322 6.063 
| 
EL 0 3.031 3.031 1.390 
0.2 3.031 2.671 1.302 0 
1 3.031 2101 1.144 0 
0 0.458 0 
0.2 1 0.881 0.429 0 
EL 1 0.693 0.377 0 


[from (29c)]. Finally, we assume a constant- 
elasticity demand function, N = np‘. Then 
n= =1 for = 0, 1.434 for = 0.2, 
and 6.063 for « =1. Throughout the exam- 
ple, we treat three demand elasticities: (i) 
e=0, the extreme of completely inelastic 
trip demand; (ii) e = 1, the highest possible 
reasonable value; and (iii) ¢ = 0.2, our best 
guess on the basis of McFadden (1974), 
Pucher and Rothenberg (1976), and Small 
(1983).!! 

From formulas in Arnott et al. (1987), we 
obtain the results given in Table 1. Recall 
that the average toll equals one-half the 
price in the three tolling regimes. Several 
points are worthy of note. First, observe 
that the uniform toll tends to be more effi- 


cient (i.e., EL" /EL° is lower) the higher the 


"Our model is sufficiently stylized that it is not 
clear what elasticity should be employed, though it 
should be long-run. Note also that since we set fixed 
travel time to zero, e is the elasticity of trips with 
respect to the variable component of trip price. The 
empirical literature, on the other hand, measures the 
elasticity of trips with respect to the total trip price. 
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elasticity of demand. This is as expected. 
There are two sources of efficiency loss from 
imposing no toll: first, because travel is 
underpriced, too many trips are taken; sec- 
ond, given the number of trips taken, cars 
do not distribute themselves efficiently over 
the rush hour. The uniform toll affects only 
the first source of efficiency loss, which is 
relatively more important the more elastic is 
trip demand. Second, with our best-guess 
demand elasticity of 0.2, more than half the 
efficiency gain from applying the fine toll 
can be achieved by applying the coarse toll. 
Third, the efficiency gain from applying con- 
gestion tolls can be substantial: relative to 
the no-toll situation, $3.03 per trip can be 
saved by applying the optimal fine toll. As 
argued in Arnott et al. (1990), this suggests 
that the potential gain from applying urban 
auto tolls may have been considerably 
underestimated. Fourth, with our best-guess 
demand elasticity, the efficiency gain from 
applying the optimal coarse toll is almost 
five times as high as that from applying the 
optimal uniform toll. Thus, the efficiency 
gain from the rescheduling of trips induced 
by an urban auto toll is likely to be larger 
than the efficiency gain from the reduction 
in trips caused by the toll. This in turn 
strengthens the argument for more serious 
investigation of technologically advanced 
tolling schemes. 


B. Capacity Optimal 


We assume that there are constant costs 
to capacity expansion [K(s)= ks] and that 
construction costs are such that the length 
of the rush hour is 2.5 hours when road 
width is chosen optimally for the no-toll 
regime. Using formulas in Arnott et al. 
(1987), we find k =15.157 for «=0, k= 
12.630 for 0.2, and k = 7.578 for = 1, 
and we obtain the results given in Table 2. 
They are qualitatively much the same as 
those presented in Table 1 and are consis- 
tent with the results given in Proposition 2. 
It is worth remarking, however, that with 
our best-guess demand elasticity, optimal 
capacity with the fine toll is only about 75 
percent of that with no toll. 
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TABLE 2—NUMBER OF COMMUTERS, PRICE OF TRIP, 
AND EFFICIENCY Loss wiTH VARIOUS TOLLING 
REGIMES WHEN Roap Capacity Is SET OPTIMALLY 
(NE =1, s§. = 0.4 vehicles /hr, a = $5.00/hr, 

B = $3.05 /hr, y = $11.88 /hr) 


Tolling regime 


Variable u c 


| 1 
0.8866 0.9150 0.9502 
0.7071 0.8281 


12.125 10.355 
11.069 9.452 
8.574 7.321 


3.551 1.780 
2.970 1.515 
2.101 1.144 


1 0.501 
0.891 0.454 
0.693 0.377 


0.4000 
.4000 
0.4000 


0.4000 
0.3885 
0.4000 


0.3416 
0.3424 
0.4000 


0.2828 
0.2944 
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VI. Extensions 
A. Heterogeneous Commuters 


The extension to treat commuter het- 
erogeneity is conceptually (though not 
algebraically) straightforward; indeed, the 
extension has already been done for the 
optimal fine-toll and no-toll equilibria with 
inelastic demand by Gordon F. Newell 
(1987), Cohen (1987), and Arnott et al. 
(1989), who consider heterogeneity in a, B, 
y, and t*. The essential insight is that in 
equilibrium user types follow a specific de- 
parture order; for example, “assembly-line 
workers” (those with a high y) depart so as 
to arrive early. Consequently, one cannot 
simply apply the analysis of the paper to an 
average user. 

With heterogeneous commuters, the re- 
sults are not as neat. The simple reduced- 
form diagram does not extend; the relation- 
ship between aggregates is no longer as 
simple; and the characteristics of the vari- 
ous toll equilibria, as well as the generaliz- 
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ability of the MHS self-financing results, 
may depend on whether the congestion toll 
is anonymous. 

While user heterogeneity undermines the 
simplicity of the resuits presented in the 
paper, it reinforces the basic message. In 
order to model user heterogeneity correctly, 
it is necessary to model different groups’ 
departure-time decisions and, hence, to 
adopt a structural approach. 


B. Other Congestible Facilities 


We cast our analysis in terms of a very 
specific congestible facility: a bottleneck on 
a point-input, point-output road in the 
morning rush hour. An obvious question is 
to what extent our results generalize to other 
congestible facilities. The short answer is 
that there are advantages to modeling all 
congestible facilities structurally and explic- 
itly treating users’ decisions and the conges- 
tion technology, but the details of the analy- 
sis will differ from one congestible facility to 
the next. We have seen, for a specific case, 
that the standard model is poorly specified. 
Structural modeling imposes the discipline 
to ensure proper specification. The stan- 
dard procedure of dividing the period of use 
into intervals and solving separately for 
equilibrium in each interval may, however, 
provide a good approximation for some con- 
gestible facilities. 

The nature of users’ decisions will differ 
according to the congestible facility. For all 
congestible facilities, time of use is an im- 
portant margin of user choice. Other mar- 
gins may be important too: with flow con- 
gestion in transportation, the cost functions 
depend on the vehicle headway drivers 
choose (Julio J. Rotemberg, 1985); a realis- 
tic model of parking would treat the deci- 
sion concerning length of time parked 
(Amihai Glazer and Esko Niskanen, 1990); 
in telephone traffic, the user decides on call 
duration; in the use of public facilities, such 
aS a swimming pool or zoo, visit length is 
important; in an art gallery, the extent of 
congestion depends on the distance viewers 
stand from pictures; and in wilderness ar- 
eas, the cost functions depend on hikers’ 
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choice of path and speed of travel 
(Mordechai Shechter and Robert C. Lucas, 
1978). 

The nature of the congestion technology 
also depends on the facility, and for most 
facilities there are several congestion-prone 
elements of capacity. For example, in going 
to a baseball game, a spectator will cruise 
for a parking spot, walk from the parking 
spot to the entrance, queue for a ticket, 
experience crowding while watching the 
game, and then after the game join a car 
queue to exit the parking lot. The extent to 
which congestion is dynamic (i.e., to which 
cumulative usage effects are important) also 
varies across facility types. 

It remains to be seen how the details of 
the analysis differ according to the type of 
congestible facility. An analysis similar to 
that of this paper has been undertaken in 
de Palma and Arnott (1990), but for tele- 
phone congestion on a single line. In that 
study, modeling of telephone congestion is 
the same as that for the road bottleneck, 
except that (i) the queue discipline is such 
that customers in the queue are served on a 
random basis, while on the road they are 
served on a first-come first-served basis, and 
(ii) telephone capacity is a stock (the num- 
ber of users that can be talking on a line at 
a point in time) while road capacity is a flow. 
The reduced-form formulas obtained were 
very similar to those in the present study. 
This demonstrates that the approach taken 
in the paper can be adapted to other con- 
gestible facilities. However, the extension to 
other congestible facilities for which there 
are multiple elements of capacity or for 
which congestion takes a form other than 
queueing will not be trivial. 

We now comment on the extent to which 
the results recorded in the propositions gen- 
eralize. Proposition 1 recorded the conges- 
tion cost function in our model under vari- 
ous pricing regimes. The results are specific 
and do not generalize. Proposition 2 com- 
pared the various tolling equilibria. The re- 
sult that the efficiency loss is larger the less 
sensitive the tolling regime is obviously gen- 
eral. The other major result concerned opti- 
mal capacity: with «<1 (>1) optimal ca- 
pacity was larger (smaller) the less sensitive 
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the tolling regime. These results generalize 
as follows. Since the tolls are set optimally 
conditional on the tolling regime, with 
identical individuals, each user faces a price 
equal to marginal social cost, and the enve- 
lope theorem holds. Consequently, the 
marginal benefit from capacity expansion is 
—dTC!/ds, and as long as marginal cost 
cuts marginal benefit from below, optimal 
capacity is larger under regime j’ than un- 
der regime j if —dTC’ /ds > —0TC’ /ds 
for all s. With «<1, —dTC’/ds should 
normally be larger the less sensitive the 
tolling regime, but the precise conditions 
will depend on the facility. Proposition 3 
related the self-financing results. Equation 
(52) generalizes, provided each user faces a 
trip price equal to marginal social cost. 

Our analysis suggests that previous stud- 
ies have significantly underestimated the 
gains from the sophisticated tolling of urban 
roads. Those studies were based on models 
that ignored both cross-price effects and the 
dynamic nature of congestion and, hence, 
failed to capture the change in the distribu- 
tion of departure times induced by tolls. 
Studies of other congestible facilities have 
generally treated cross-price effects but not 
the dynamic nature of congestion. Thus, it 
is unclear whether structural models of those 
facilities would uncover significant previ- 
ously unperceived gains from more sophisti- 
cated pricing policies.'* 


'2 Compared to roads, the pricing of most other 
congestible facilities (e.g., telephone, computer, electri- 
cal, and water networks) is relatively sophisticated. 
Time-dependent pricing is the rule for long-distance 
telephone traffic and is not uncommon for electrical 
power. Priority pricing (whereby jobs with a higher 
priority move up the queue more quickly) is the rule on 
mainframe computers. Usage-dependent pricing is em- 
ployed in some contexts; the University of Cambridge 
employs it for computer usage, and Hydro Québec 
employs it in some of its electrical pricing. Other forms 
of peak-load pricing include priority-service pricing 
(Robert Wilson, 1989) and demand-layer pricing (Ro- 
bert S. Main, 1973). In all contexts, however, more 
sophisticated pricing is possible. For example, long-dis- 
tance telephone rates could presumably be made very 
sensitive to the level of usage: a customer would dial a 
long-distance number, the efficient tariff based on cur- 
rent system usage would be displayed, and she would 
then decide whether to proceed with the call. 
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VII. Conclusion 


In the Introduction we argued that the 
standard model of peak-period traffic con- 
gestion is poorly specified because it fails to 
model commuters’ departure-time decisions 
and the congestion technology. To our 
knowledge, the first paper in the literature 
that addressed these deficiencies was Vick- 
rey (1969). He constructed a model of the 
morning commute on a road with a single 
bottleneck in which a fixed number of iden- 
tical individuals wish to arrive downtown at 
the same time and choose when to depart, 
trading off schedule delay against travel 
time. The queue length adjusts until, in 
equilibrium, trip price is uniform over the 
peak period. The solution to the model pro- 
vides a function relating trip price and 
marginal social cost to capacity, the number 
of users over the entire rush hour, and the 
form of pricing. This yields the correctly 
specified reduced-form supply and cost 
functions. The trip demand functions, too, 
should be specified over the entire rush 
hour. Thus, with identical commuters, the 
version of the standard model which treats 
the period of use as a single interval is 
sound. However, the procedure employed 
in much of the literature, of dividing up the 
period of use into intervals and solving sep- 
arately for equilibrium in each interval, is 
logically flawed. 

In this paper, we examined some of the 
economic implications of the Vickrey model 
and also extended it to treat elastic demand 
and optimal capacity under a variety of pric- 
ing regimes. Two particular findings merit 
emphasis. First, with identical individuals, 
the MHS results on the degree of self- 
financing of congestible facilities with opti- 
mal capacity and optimal tolls were shown 
to apply regardless of the tolling regime. 
Hence, if a road of optimal capacity should 
be self-financing with an optimal time- 
varying toll, it should also be self-financing 
when only a gasoline tax (set at the optimal 
level) can be employed. Second, we com- 
puted the gains from efficient pricing to be 
considerably greater than those given in the 
empirical literature on urban auto conges- 
tion. The reason is that previous empirical 
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estimates are based on a model that ignores 
the efficiency gain that results from a toll’s 
ability to redistribute travelers over the rush 
hour. This suggests that more serious con- 
sideration should be given to sophisticated 
tolling schemes on urban roads. 

While the analysis focused on rush-hour 
auto congestion, we discussed the general- 
ization of our approach to other congestible 
facilities. We argued that there are always 
advantages to providing a structural model 
of a congestible facility, which explicitly 
treats users’ decisions and the facility’s con- 
gestion technology. 

The appropriate directions for future re- 
search are evident. Vickrey’s (1969) paper, 
supplemented by this one, provides the 
method for determining structural models 
of facilities subject to peak-load congestion. 
However, the modeling of the congestion 
technology is primitive. What we need now 
are more realistic models of congestion 
technologies and of consumers’ behavioral 
decisions, along with empirical estimation 
of them. In the context of rush-hour traffic 
congestion, for example, models should be 
developed which derive hypercongestion 
(traffic-jam situations) from driving behav- 
ior, solve for equilibrium on a congested 
network, and account for heterogeneity 
among users in addition to accident and 
road damage costs. As well, it is clearly 
desirable to apply Vickrey’s (1969) ap- 
proach to other types of congestible facili- 
ties, notably computers, public utilities, air- 
ports, telecommunications, and recreational 
facilities. 
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Entry, Dumping, and Shakeout 


By RicHarp H. CLaripa* 


This paper studies the process of entry, dumping, and shakeout and _ the 
properties of the free-trade equilibrium that results in a world in which countries 
differ in their stock of technological knowledge, but not necessarily in their stock 
of technological capability. The model can account for the dumping of products 
by newly industrializing countries and technological newcomers at a price that 
fails to cover average cost in the face of a surge in world demand for the 
dumped product, a phenomenon that has not been uncommon in recent years. 


“{Between 1981 and 1986] the number 
of [dumping] petitions resulting in in- 
terventions...increased 258% [relative 
to the 1975-1980 period]. It is difficult 
to believe...that dumping has risen to 
more than two and one-half times its 
previous level especially since the 
strong dollar during the latter period 
made selling in the United States eas- 
ier for foreigners rather than harder.” 
[Economic Report of the President, 

1988 p. 159] 


Over the past decade, there has been an 
explosion in the number of dumping cases 
brought by U.S. producers against foreign 
exporters. The U.S. trade law defines dump- 
ing to occur when imports are sold in the 
United States at a price that is less than the 
home-country price (the traditional defini- 
tion of dumping) or when the import price 
is less than the “constructed value” or aver- 
age cost of production of the imported 
good (since a 1974 change in the law). In 
fact, in recent years, formal dumping com- 


*Columbia University and the National Bureau of 
Economic Research. This is a substantially revised 
version of an earlier paper entitled, ‘Destructive Com- 
petition, Dumping, and Learning by Doing.” | thank 
Ron Findlay, Jagdish Bhagwati, Ricardo Cabellero, 
Maurice Obstfeld, Avinash Dixit, Arie! Pakes, two ref- 
erees, and seminar participants at Columbia, Yale, 
Pennsylvania, Chicago, the NBER Summer Institute, 
UCLA, and New York University for their comments 
and suggestions, many of which have found their way 
into this paper. I am responsible for all remaining 
errors. 


plaints have increasingly often been based 
upon allegations of “pricing below cost” 
and not on allegations of price discrimina- 
tion in favor of U.S. consumers (Howard 
Gruenspecht, 1988).! 

In policy discussions, dumping is often 
attributed to a situation in which too many 
producers are chasing too few consumers.” 
In a cyclical downturn, considerations such 
as exit costs and implicit labor contracts 
may very well make it optimal and ex ante 
rational for existing firms to continue to 
produce at a price that falls short of average 
total cost.? Although the severity of the 
1981-1982 worldwide recession no doubt 
can account for some of the surge in dump- 
ing findings during this decade, the fact that 


'This is not to say that international price discrimi- 
nation on the part of exporters with market power, 
Jacob Viner’s (1923) definition of dumping, is uncom- 
mon or unworthy of study. Rather, formal dumping 
complaints, at least in the United States, are increas- 
ingly often based upon allegations of “pricing below 
cost” regardless of transactions prices in home markets 
(Gruenspecht, 1988). Furthermore, “in testing for the 
presence of dumping, the foreign transactions price 
cannot be used for comparison with the American sales 
price if the former is less than the cost of 
production...[which] must include general expenses 
equal to at least 10 percent of manufacturing costs plus 
a profit margin of at least 8 percent” (Wilfred Ethier, 
1982). 

For a superb account of the political economy of 
dumping, see Jagdish Bhagwati (1986). 

“This was first shown rigorously by Ethier (1982), 
although the idea of cyclical dumping dates back to at 
least Viner (1923). In the conclusion, I briefly discuss 
the recent literature on strategic-duopoly models of 
dumping. 
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the number of preliminary 
dumping by the United States Department 
of Commerce approached or exceeded all- 
time highs in the later years of the 
1982-1990 economic expansion indicates 
that cyclical factors, as they are normally 
understood, cannot provide a complete ex- 
planation of dumping during the 1980's 
( Annual Report on the Operation of the Trade 
Agreement Program, various issues). 

In this paper, I bring together the Ricar- 
dian model of international trade with a 
simple version of Boyan Jovanovic’s (1982) 
model of entry, selection, and the evolution 
of industry to study one mechanism by which 
dumping can occur. The model developed 
in this paper can account for dumping cases 
in which foreign producers sell at a world 
price that fails to cover average cost when 
there is a surge in demand for the dumped 
product and a rise in total production of the 
dumped product in the dumping country. 
As I document in Section V, the conjunc- 
tion of dumping at less than constructed 
value with a rise in the production and 
demand for the dumped product has not 
been an uncommon occurrence in recent 
years. It is, however, difficult to reconcile 
with traditional explanations of cyclical 
dumping, explanations that in fact account 
for dumping at less than average cost in the 
face of a fall in production and demand for 
the dumped product. 

The novel feature of the two-sector, two- 
country Ricardian model developed in this 
paper is that, while entry is endogenous, 
firms can only learn about their production 
function for producing one of the goods by 
actually producing that good (as in Jo- 
vanovic [1982]). Furthermore, production 
functions, as indexed by a Ricardian labor 
productivity parameter, differ across firms. 
However, because producers are ex ante 
identical, all producers share a common en- 
try decision rule based upon their knowl- 
edge of the population distribution of pro- 
ductivities across firms, the terms of trade, 
and their rational expectation of the future 
terms of trade. In turn, the equilibrium 
terms of trade depend upon the entry deci- 
sions made by firms, so that in a rational- 
expectations equilibrium, entry decisions 


findings of 


CLARIDA: ENTRY, DUMPING, AND SHAKEOUT 18] 


and the terms of trade are mutually consis- 
tent functions of the parameters that de- 
scribe technology and demand. 
Endogenizing entry substantially enriches 
the conditions under which dumping can 
occur under perfect competition and ratio- 
nal expectations. Intuitively, dumping can 
arise because the present value of entry 
includes the value of information about fu- 
ture profitability. Such knowledge is valu- 
able because the most productive producers 
earn positive current and future rents in 
equilibrium. Competition among ex ante 
identical producers for these present and 
future rents can drive down the price below 
the opportunity cost of the ex post efficient 
and inefficient firms who enter in search of 
the rents that accrue to the most efficient 
among them. I model a world of two coun- 
tries that differ only in their initial stock of 
knowledge about the technology for produc- 
ing one of the goods: the countries do not 
differ in the availability or distribution of 
this technology among firms, nor do they 
differ in size or tastes. Nonetheless, I show 
that if world demand is sufficiently high, 
dumping can occur in equilibrium. Interest- 
ingly, I also show that if world demand is 
sufficiently low, dumping cannot occur. 
High demand induces entry of firms seek- 
ing to capture the rents accruing to the 
most efficient (ex post) among them. Entry 
among ex ante identical firms pushes down 
the world price below the opportunity cost 
of new entrants, and these firms will in fact 
initially export in equilibrium. Ex post 
high-cost producers recognize their losses 
and exit. The rents that accrue to the re- 
maining producers in the dumping country 
exactly equal the net losses incurred as a 
result of dumping. Despite the fact that 
high world demand induces entry to the 
point that price is driven below private op- 
portunity cost, which is the output per en- 
trant forgone by (or average cost of entry 
to) the economy, I show that welfare in both 
the dumping (exporting) country and the 
importing country improve in the free-trade 
dumping equilibrium relative to autarky. 
The plan of the paper is as follows. In 
Section I, I set up the model, and Section II 
derives the optimal entry decision rule. In 
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Section III, I specialize the model and ob- 
tain a closed-form expression for the au- 
tarky ex ante entry price in terms of the 
parameters of demand and technology. In 
Section IV, I provide explicit sufficient con- 
ditions for the existence of a dumping equi- 
librium in a two-country world in which the 
countries differ only in their initial stock of 
technological knowledge, not their techno- 
logical capability, their size, or their prefer- 
ences. In this equilibrium, positive trade 
flows arise from the (intertemporal) 
profit-maximizing efforts of firms in one of 
the two countries to accept a price below 
opportunity cost in exchange for the option 
of earning positive rents that accrue to the 
ex post most efficient among them. I also 
show that, while dumping coincides with 
excessive entry (excessive in that new en- 
trants on average suffer losses), the entry 
that does occur maximizes social surplus 
subject to the constraint that productivity 
must be learned via production. Section V 
explores some of the implications of the 
analysis and surveys some recent dumping 
cases in which technology and a surge in 
world demand for the dumped product ap- 
peared to play a prominent role. Section VI 
provides concluding remarks and relates the 
results to those obtained elsewhere in the 
literature. 


I. Setup of the Model 


I consider a two-good, two-period, two- 
country Ricardo-Jovanovic world. Each 
country is populated by N+M en- 
trepreneurs. Each entrepreneur supplies in- 
elastically one unit of labor, the sole factor 
of production. The two goods, z and y, are 
produced by entrepreneurs with constant- 
returns-to-scale production functions. I shall 
refer to entrepreneurs and firms inter- 
changeably. In both countries, all N+M 
entrepreneurs are endowed with one unit of 
labor and a technology that can be used to 
produce w units of y. N entrepreneurs in 
each country are endowed not only with one 
unit of labor and a technology for producing 
y, but also with a technology that can be 
used to produce z. Either w units of y or é' 
units of z, but not both, can be produced in 
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a given period by each potential entrant, 
i=1,...,N, to the z sector;* é' differs 
among firms and is distributed according 
G(é), which is common across both coun- 
tries. Each entrepreneur knows this distri- 
bution but cannot learn the value of his 
particular € without first producing z. 

I rule out borrowing and lending so that 
consumption must be financed by the value 
of current output. Let p (p’) denote the 
present (future) price of z in terms of y; p 
and p’ are endogenous in equilibrium but 
are taken as given by firms. Finally, I as- 
sume that preferences are time-separable 
and Cobb-Douglas. Let s denote the share 
of spending that falls on z by the N en- 
trepreneurs that can produce z or y in each 
country, and let a@ denote the share of 
spending that falls on z by the M en- 
trepreneurs that can only produce y in each 
country.> 


II. The Entry Decision Rule 


To derive the optimal entry decision rule 
followed by the N potential entrants to 
the z sector, begin by observing that the 
static utility gained by producing z in the 
present is s°(1— s)'~*( p'~*)é! while the sta- 
tic utility obtained by entering sector y is 
s*(1—s)'~*(p~)w. Entry into sector z will 
maximize expected current utility whenever 


(1) 


Otherwise, static utility is maximized by en- 
tering sector y. Of course, to the extent that 
the entry decision in the present impacts 
upon expected utility in the future, (1) is not 
the optimal first-period entry decision rule. 
However, by substituting p’ for p, (1) does 
describe the optimal second-period decision 
rule for firms that did not enter sector z in 


*Production of z or y by agent 7 in the present in 
no way changes the parameters of the of technologies 
available for producing either z or y in the future (i.e., 
&€'=&" and w =w’ in the future. 

Letting primes denote future values, entrepreneurs 
who can produce x or y maximize z°y!~* + zy! 
while those who produce y maximize z“y!~* + 


Zz 
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period 1. Rearranging, one obtains the fol- 
lowing proposition. 


PROPOSITION 1: For all firms that pro- 
duce y in period 1, entry to sector z in period 
2 will occur if 


(2) p’>w/Eé' 


where w /E€' is the ex ante static break-even 
price of z, the price at which entrepreneurs 
achieve the same expected utility in either 
sector z or sector y. 


What is the expected second-period util- 
ity of an entrepreneur who entered sector 
z in period 1? Such an entrepreneur 
learns his productivity in producing z. 
With this knowledge he can attain utility 
*)w if he switches to sector 
y, and he can attain utility s*(1—s)'~* x 
(p'*)é! if he stays in sector z. Note that, 
as of the beginning of period 2, é' is not 
random, so that 


(3) s)'~*E max|( p’*yw| 


is the expectation in the present of the 
utility in the future of an entrepreneur who 
enters the z sector in the present. 

By contrast, it is easily seen from (1) and 
Proposition 1 that the expectation in the 
present of the utility in the future for an 
entrepreneur who enters the y sector in the 
present is given by 


(4) s\(1-s)'* 
x max| ( p’*)w]. 


The difference between (3) and (4) reflects 
the net expected value of information about 
the ability to produce z. Is this value posi- 
tive? Since max[-] is a convex function, by 
Jensen’s inequality, the answer is given as 
the following proposition. 


PROPOSITION 2: 


(5) Emax|(p*)é,( w] 


> 
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The nonnegative value of information may 
be understood as follows. When p’ > w / E& 
in period 2, knowledge of é' obtained as a 
result of early entry allows ex post low- 
productivity firms to avoid remaining in the 
z sector even though all first-period y- 
producers—including those of low produc- 
tivity—will by (2) maximize expected utility 
by entering the z sector at this price. When 
p'<w/Eé in period 2, knowledge of é' 
obtained as a result of early entry signals 
high-productivity firms to remain in the z 
sector even though no period-1 y-producer 
will enter the z sector at this price. Thus, 
early entry has value, an option value if you 
will, if it provides information that permits 
high-productivity (low-productivity) firms to 
take advantage of (avoid) producing z when 
the price is low (high) relative to the oppor- 
tunity cost of resources. 

Pulling together (1), (4), and (5), I am 
now in a position to obtain a general ex- 
pression for the entry price at which firms 
are ex ante indifferent between entering 
sector z and entering sector y in period 1. 
Letting b denote the static ex ante break- 
even price, 


(6) b=w/Eé' 


and letting e denote the (forward-looking) 
ex ante entry price, one has the following 
proposition. 


PROPOSITION 3: The rational, ex ante en- 
try price at which firms are indifferent be- 
tween entering the z sector and entering the y 
sector is given by 


(7) e=b—(1/E€')Emax| p’é',w] 

+ (1/E€')max| p'Eé',w]. 
Furthermore, e<b, with strict inequality 
whenever the net value of information is posi- 


tive. 


For future reference, note that (7) can be 
rewritten as 


(8) e=b-(1/E€é')Emax| p’é' w,0] 


+ (1/Eé')max| w,0] 
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I conclude this section by observing that 
the ex ante entry price is related to the 
variance or, more generally, the dispersion 
of the probability distribution of productivi- 
ties across agents. Indeed, a sufficiently large 
mean-preserving spread of G around Eé€' 
must drive the ex ante entry price below the 
static break-even price.® For completeness I 
state the following corollary. 


COROLLARY 1: Given p’, b, and EE' there 
exists a mean-preserving spread of G such 
that 


(9) E max| p'é' — w, 0] 
> max| p'Eé' w,0] 


(i.e., the value of learning about productivity 
is positive) and 
(10) e<b 


(i.e., the ex ante entry price is less than the 
static break-even price). 


III. Entry and Demand 


Proposition 3 provides an expression for 
the ex ante entry price in terms of the 
opportunity cost of resources, w, the mo- 
ments of the distribution of productivity 
across agents, G(é), and the expected fu- 
ture price of z, p’. In any interesting model 
of dumping, p’ must be endogenous. In the 
present model, entry decisions in period 1 
depend upon the expected future price of z. 
However, the future price of z depends 
upon the future supply of z, which depends 
upon entry decisions in period 1. In a 
rational-expectations equilibrium, second- 
period prices and first-period entry must be 
mutually consistent. 

Although my primary interest is in provid- 
ing an account of dumping in international 


°PROOF: If max[ p’Eé' — w,0]> 0, a positive prob- 
ability of a sufficiently low realization of €'=€ such 
that p’&<w must make Emax{[p’é'—w;0] greater 
than max{ p’Eé' — w,0)]. If max{[ p’Eé' — w,0]= 0, a pos- 
itive probability of a sufficiently high realization of &' 
must make E max{ p’é' — w,0] positive. 


MARCH 1993 


trade, I begin by solving the model for the 
autarky equilibrium in each country. Later, 
these results are used to derive the condi- 
tions under which dumping (the export of 
goods at a common world price below the 
average cost of production) can occur. 
Moreover, if the two countries are identical 
in all respects and none of the potential 
z-entrants in either country knows its firm- 
specific productivity in producing z, autarky 
will be the free-trade equilibrium. 

To solve for the rational-expectations 
equilibrium, it is useful to make several 
further assumptions. I assume that s = 0; 
that is, the share of spending by potential or 
actual z-producers that falls on z is zero. 
This assumption insulates the position of 
the demand curve for z from the entry and 
exit decisions made by the producers of z. 
Entry and exit will have a decisive influence 
on price, but only because these decisions 
shift the supply curve along a given demand 
curve.’ I shall also assume that there are 
three productivity parameters é' and that 
these are distributed among the N potential 
z-entrants in each country according to the 
following distribution G: 


(11) &=1: shared by (1—277)N agents 
=1+@: shared by agents 
€=1-—o: shared by agents 


with 0<@ <1 and 0<7 <3, so that 
(12) 


Var é = 27”. 


‘Jovanovic (1982) also assumes that the position of 
the demand curve is invariant to entry and exit deci- 
sions. Jovanovic’s model is much more general in that 
he allows for normally distributed shocks, Bayesian 
learning over time about the production function, and 
an infinite planning horizon. What I purchase with my 
simpler setup is the ability to characterize completely 
the two-country free-trade equilibrium in a Ricardian 
world in which countries differ in their initial stock of 
technological knowledge and to show how dumping— 
which is the export, not just the sale, of goods at a 
price below average cost—can occur. 
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The third assumption I shall make is that 
N and M are large enough so that the 
frequency distribution of productivities 
among firms that actually enter the z sector 
in period | is given by the population distri- 
bution.® It follows that the equilibrium p’ is 
a function of the number of firms that enter 
in period 1, the population distribution of 
productivities, and the parameters of the 
demand curve for z, which for the Cobb- 
Douglas preferences is given by 


d.(p)=Maw/p. 


It will be noted that, for the distribution in 
(11), the static break-even price is equal to 
the opportunity cost of resources w, b= w. 
These assumptions are sufficient to charac- 
terize completely both the autarky and 
free-trade equilibria. I begin with the fol- 
lowing corollary. 


COROLLARY 2: Under the above three as- 
sumptions, the ex ante entry price is 


(14) e=w-7(1-@) 


x max[w /(1— — p’, 0} 


so long as p’ => w. If p' < w, the ex ante entry 
price is given by” 


(15) 


e=w-7(1+¢) 
x max| p’ -—w/(1+¢),0]. 


It follows that the ex ante entry price is 
bounded by 


w(1-7¢) <e<w. 


Equation (14) says that early entry has 
value, so that e is driven below break-even 


‘This assumption is in no way essential to the paper’s 
main results, because all of these results go through in 
the special case in which demand for z at the ex ante 
entry price exceeds the maximum supply of z, N. Of 
course in this case, the distribution of productivities 
among entrants is equal to the population distribution. 

The proof of (14) and (15) just requires plugging 
the distribution (12) into equation (7) and canceling 
terms. 
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level w, if it provides information that per- 
mits low-productivity firms to avoid produc- 
ing z when the future price of z is high 
relative to the opportunity cost of resources. 
Equation (15) says that early entry has value, 
so that e is driven below break-even level 
w, if it provides information that permits 
high-productivity firms to take advantage of 
producing z when the future price of z is 
low relative to the opportunity cost of re- 
sources. In other words, early entry has 
value if it conveys knowledge that results in 
experienced firms making production deci- 
sions in the future that differ from the pro- 
duction decisions of new entrants in the 
future. 

For ease of exposition, I shall confine my 
investigation of the phenomenon of entry at 
a price below average cost (and later the 
phenomenon of dumping—the export of 
goods at a price that falls short of average 
cost) to equilibria in which the future price 
of z, p’, equals or exceeds the break-even 
level w. A sufficient condition for this to 
occur in the closed economy is simply the 
following:'” 


(16) (1-7)/m> 


Note that since 7 <4, the restriction that 
gy <5 will insure that p’ > w. 

Corollary 2 does not pin down the condi- 
tions under which the equilibrium price of z 
in period 1 is driven below the ex ante 
break-even price w. The following proposi- 
tions do. 


PROPOSITION 4: Given <3, w, 7, 
and M, a sufficient condition in the closed 
economy for the ex ante entry price to be 


Since is an upper 
bound on first-period demand for z, as well as an 
upper bound on the number of first-period entrants. A 
fraction 7 of these firms are high-productivity and are 
willing to remain in the z sector if w/(1+ @) < p’. The 
output produced by these firms is then aM7(1+ ¢)/ 
(1— 7). If second-period demand at the break-even 
price p’=w exceeds this supply, p’ >w. This is the 
condition that aMw/w>aMa(1+ ¢)/U—7¢) or 1 
— > 2g. Since 7 <4, p'>w if p<}. 
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driven below the break-even price, e <w, is 
the following:"' 


(17) aM(1-¢)/[1-7(1-¢)] <N. 


First-period entry has an option value if 
experienced firms can avoid losses in the 
future that inexperienced firms may incur. 
A fraction 7 of new entrants in the future 
will incur losses if p’ < w /(1— ¢). Equation 
(17) is just the condition that potential fu- 
ture supply of z be large enough relative to 
demand at the price w /(1— ¢) so that the 
equilibrium future price falls short of this 
level, p’ <w/(1— ). Thus, first-period en- 
try has a value in that it permits ex post 
low-productivity firms to avoid producing z 
when the price of z is high relative to 
opportunity cost w. 

I now provide sufficient conditions for the 
equilibrium period-1 price to be driven be- 
low opportunity cost. This means that a 
fraction 1—7>4 of the firms that enter 
suffer losses in period 1 and that these 
losses exceed the rents earned by the 
ex post most efficient producers. 


PROPOSITION 5: There exists a_ great 
enough number of potential entrants, N such 
that, for N > N, the equilibrium price of z in 
the closed economy is driven below the 
break-even level, p< w. N is given by 

(18) 

Furthermore, there exists an N. > N such that, 
for all N>N,, the period-1 price is driven 
down to its minimum value p = w(i— 7¢). 
At this price, the average loss in terms of 
opportunity cost sujfered by a new entrant to 
the z sector is given by 


(19) 


Furthermore, N, is given by 


(p-—w)/w=-o7 <0. 
(20) 


(See the Appendix for a proof.) 


''T will establish that if ex ante identical producers 
can sign an Arrow-Debreu insurance contract, that 
contract will specify a first-period entry price of p =e. 
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Even though Proposition 4 implies that 
the entry price e is driven below the break- 
even price w, it does not imply that the 
equilibrium price p is driven below the 
break-even price. For this to be the case, 
the output produced by all potential en- 
trants N must exceed demand at the 
break-even price. Combining this condition 
with Proposition 4, I obtain Proposition 5. 
The max operator’s two terms arise because 
there are two requirements for the equilib- 
rium price to be driven below the break-even 
price. A necessary condition is that e<w, 
the second term in (18). It is also nec- 
essary that supply be large enough rela- 
tive to demand so that p<w. This will 
be true when N>aMw/w, the first term 
in (18). 

It is easy to see that e < w does not imply 
p<w. For example, suppose that demand 
at the break-even price equals the output of 
all potential entrants aM = N sothat p=w. 
In period 2, the low-productivity firms, 7N 
in number, drop out, so that period-2 out- 
put falls to N-—mN(U-—g). The period-2 
price will rise; but if p’ <w/(1— ¢), which 
will be true if 


N>aM(1-¢)/[1-7(1-¢)] 


then e<w by Proposition 4. This happens 
for ¢>a7/(1+7). Thus, both terms are 
needed in (18) to insure that the equilib- 
rium price be driven below the break-even 
price. 

Since all firms share the identical entry 
decision rule, the aggregate supply curve is 
perfectly elastic at the ex ante entry price e 
until z = N, at which point it is completely 
inelastic. Recalling that e is the price at 
which firms are indifferent between enter- 
ing the z sector or the y sector in period 1, 
nothing guarantees that all firms that are 
indifferent between producing z or y will 
actually produce z. In particular if d.(e) < 
N, all N producers cannot produce at e. If 
producers are rational, they take into ac- 
count this possibility when forming their 
decision rule. However, if when d.(e)< N 
firms are chosen randomly from among the 
N potential entrants so that a fraction 
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d.(e)/N is selected, then the optimal entry 
decision rule is the same.'? 

Before showing how a free-trade equilib- 
rium with dumping can arise, I first briefly 
explore the relationship among the ex ante 
entry price, the number of firms that enter 
the market for z, and the number of firms 
that would have entered the market for z 
(and the price they would have received) 
had each firm known ex ante its particular 
productivity in producing z. I define a de- 
structive (deficient) competition equilibrium 
as an equilibrium in which, because firms 
are ex ante identical but ex post different, 
too many (too few) firms enter, driving the 
price below (keeping the price above) the 
level that would have prevailed had each 
firm known ex ante its particular productiv- 
ity in producing z. In a destructive competi- 
tion equilibrium, price is driven below aver- 
age (opportunity) cost in period 1. I also 
show that, for certain parameter values, not 
only does a deficient competition equilib- 
rium exist in this model, but perhaps sur- 
prisingly, price is driven below opportunity 
cost in a deficient competition equilibrium. 


'2 However, the interpretation is slightly different 
(cf. Avinash Dixit and Carl Shapiro, 1986): e becomes 
the price at which firms are indifferent between 
(i) entering a lottery with probability d,(e)/N of win- 
ning, in which the winners produce z and the losers 
produce y, and (ii) not entering the lottery and just 
producing y. The proof of this result is just algebraic 
manipulation of (8). However, one insight is required. 
The probability of winning the lottery in the second 
period, 6’, for firms that did not enter in the first 
period is endogenous and influences e according to 


e=B-( 1/E€') E max[ pé'- w,0] 
+ (1/Eé')6’ max{ p'Eé' — w,0]. 


Recall from Proposition 1 that firms will be indifferent 
between newly entering z in period 2 and staying in 
y when p’Eé=w. This can be shown to imply that, 
when p’Eé > w, 0’ must equal | and that, when 0’ <1, 
p'EEé—w=0. Thus, without loss of generality, equa- 
tion (8) defines the entry decision rule. This is not to 
say that expected utility is invariant to 6’ or @ given p 
and p’, but rather that utility is changed in such a way 
that entry decisions are unaffected. 
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PROPOSITION 6: A destructive competi- 
tion equilibrium exists if 


(21) aM/[1-7(1-¢)] 


Furthermore, p < w in a destructive competi- 
tion equilibrium. A deficient competition 
equilibrium exists if 


(22) aM/r<N. 


Furthermore, p<w in any such deficient 
competition equilibrium. 


How can aggregate losses occur when the 
price of z is higher ard fewer firms enter 
than under ex ante knowledge of firm- 
specific productivities? In this model, price 
is driven below opportunity cost in the first 
period because ex ante identical firms are 
willing to accept expected losses in the pres- 
ent in exchange for the future profits that 
will accrue to the most productive among 
them. The fact that the number of firms 
that actually enter in period 1 (recall that 
all N are ex ante indifferent, as discussed in 
footnote 12) is less than would be the case 
under ex ante knowledge of firm-specific 
productivities is irrelevant. 


IV. Dumping in a Free-Trade Equilibrium 


With the results obtained in Section III, it 
is now straightforward to provide sufficient 
conditions for dumping te occur in a free- 
trade equilibrium. Dumping will be said to 


‘PROOF: The number of producers who, under 
full information, would produce at the ex ante entry 
price e=w(l—7o¢)<w, is wN. Their total output 
would be wN(i+ ). The price they would receive is 
aMw When e=w(1—7 9), the second 
inequality in (21) implies that total output and entry 
are greater at e (and e itself is less than the price that 
would prevail under full information) than with ex ante 
knowledge about firm-specific productivities. Since 
for the set of N that 
satisfies (21) is nonempty. By Proposition 5, p< w. 
When e = w(1 — 79), (22) implies that total output and 
entry are less at e (and e itself is higher) than under 
ex ante knowledge of firm-specific productivities. By 
Proposition 5, p< w. 


occur when z is exported at a common 
world price that falls below the opportunity 
cost of production in the exporting country. 
For positive trade flows—and thus dumping 
—to occur, it must be the case that the two 
countries not be identical. As discussed ear- 
lier, the results in Section II characterize 
the autarky equilibrium in each country, 
which is the free-trade equilibrium when 
the two countries are identical. By Proposi- 
tions 4, 5, and 6, the price of z may be 
driven below opportunity cost in period 1. 
However, since the two countries are identi- 
cal, no trade flows between them will arise 
in equilibrium, and no dumping can be said 
to occur. 

I begin with the symmetric case in which 
countries | and 2 differ in only one respect: 
each potential z producer in country 1 
knows its firm-specific productivity in pro- 
ducing z, while no potential z entrant in 
country 2 knows its firm-specific productiv- 
ity. Otherwise, the distribution G(€) of pro- 
ductivities across firms, the number of pro- 
ducers of y and potential producers of z 
(M and N, respectively), and their prefer- 
ences are identical. Thus it can be said that 
countries 1 and 2 differ only in their initial 
stock of technological knowledge and not in 
their initial stock of technological capability. 
Interestingly, not only will such a difference 
in the initial stock of technological knowl- 
edge induce trade flows between the two 
otherwise identical countries, but under 
sufficient conditions to be established now, 
dumping will occur in the free-trade equi- 
librium. That is, in the first period country 2 
will export z to country 1 at a common 
world price that is driven below the average 
cost of production in country 2. In the sec- 
ond period, following the excessive entry 
and resulting destructive competition that 
transpires in period 1,'* a rationally ex- 
pected shakeout occurs: inefficient country-2 
firms, discovering their inability to cover 
opportunity cost, exit the z sector. This exit 
of country-2 producers drives up the period- 
2 price of z and acts to reduce or eliminate 
country 1’s trade deficit in z. 


‘41m what follows, I show formally that if dumping 
occurs in the symmetric case, the equilibrium must be a 
destructive-competition equilibrium. 
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PROPOSITION 7: Given 7, ¢, M, and a, 
there exists a symmetric free-trade dumping 
equilibrium if 


(23) N<max[2/{1+7(1+¢)}, 


(1-¢)/{1-7(1- ¢)}]am 


In this dumping equilibrium, country 1 im- 
ports z from country 2 in period 1 at a price p 
that falls below average cost w. 


(See the Appendix for a proof.) 

This is a striking result. It says that for 
any values of the distribution parameters 7 
and g, the expenditure share a>0O, the 
“wage” (opportunity cost) w, and the posi- 
tion of the world demand curve as indexed 
by M (the number of z-consumers in each 
country), there exists a number, N, of po- 
tential z-producers in each country such 
that country 1 is net importer of z at a price 
that is driven below the break-even point 
for country-2 exporters. This is despite the 
fact that the two countries differ only in 
their initial stock of technological knowl- 
edge, and not in their capability. 

Three conditions must be met if dumping, 
the export of goods at a price below average 
cost, is to occur. First, there must be some 
value to learning about productivity, and 
thus firms must be willing to suffer static 
losses so that e < w. This is just the condi- 
tion that p’<w/(i-—). In this symmetric 
world in which countries only differ in their 
stock of knowledge, not their stock of tech- 
nology, this condition is the same as dis- 
cussed under Proposition 4: (world) demand 
at the price p’=w/(1—@g), 2aM(U— 9), 
must fall short of the minimum (world) sup- 
ply at that price, 2N —27N(1— ¢). This is 
the second term in max{-]. It is graphed in 
Figure 1 and is denoted by the demand 
curve D3. All demand curves to the left of 
D3 insure p’ <w/(1— ¢) so that country-2 
firms are willing to enter the z sector in 
period | at a price e<w. The equilibrium 
period-1 price must in fact fall short of the 
break-even price w. What is the world sup- 
ply at p < w? It is the supply of the efficient 
country-1 firms, wN(1+ ¢), plus the supply 
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FiGurRE 1. WorLD SupPLY AND DEMAND 


Notes: 


Supply a = 7N(1+ ¢); supply b= N+ 71+ 
New entrants in country 2 will be willing to sell z 


y); supply c = 2N-—27N(1-— ¢). 
at a period-1 price e < w if world 


demand lies to the left of D3. The equilibrium period-1 price p will fall below w if 
world demand lies to the left of D2. Country 2 will export z to country 1 if world 
demand lies to the right of D1 but to the left of D2. Thus, dumping will occur if world 


demand falls between DI and D2. 


of all country-2 entrants, N. Thus, p < w if 
2aM <N+7N(1+ ¢). This is the firm term 
in max[:], which is graphed in Figure | and 
denoted by the demand curve D2. Thus, 
when the world demand curve lies to the 
left of D2 and D3, country-2 firms are not 
only willing to receive a period-1 price p< 
w, but do in fact receive it. As Figure | is 
drawn, 7 is sufficiently small so that N+ 
wN(i+ @), the period-1 world supply at p 
< w, is less than 2N —27N(i — o), the min- 
imum period-2 supply at a price w <p’ < 

Finally, there is a third requirement for 
dumping that is not present in the closed 
economy: the world demand at p must be 
more than twice as great as the supply of 
the efficient country-1 firms: 2aMw /p> 
27N(1 + ¢). This condition is satisfied if the 
second inequality in (23) is met; it is graphed 
in Figure 1 as the demand curve D1. All 
demand curves to the right of D1 satisfy the 
requirement that the dumping country ex- 
port the dumped good. Otherwise, even 
though firms in the “laggard” country sell 
below average cost, country 2 imports z 


from the country with the head start. That 
is, the low-cost producers in period 1 are 
the aN efficient firms in country 1 that are 
willing to produce at price w /(1+ g)<e. In 
this symmetric world, N cannot be too large, 
given demand, because large enough N im- 
plies that country 1 exports to country 2. 
However, this should not obscure the fact 
that, if one holds constant the number of 
country-! firms and world demand, there 
exists a sufficiently large number of poten- 
tial country-2 entrants such that dumping 
must occur. 


COROLLARY 3: The symmetric free-trade 
dumping equilibrium is a destructive-competi- 
tion equilibrium.'> If world demand is too 
low relative to potential supply, dumping can- 


°PROOF: If country 1 is to be a net importer of z, 
it must be the case that per-country demand at p 
exceeds country-! supply, 7(1+ Since country-1 
supply is the supply of z in country 2 that would be 
forthcoming if country-2 producers had knowledge of 
firm-specific productivities, a Mw / p — 77(1+ )N, “too 
many” country-2 firms enter the z sector in period 1. 
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not occur. Specifically, given N, M, 7, and ¢, 
there exists a value of a, a, such that if 
0 <a<a, country I exports z to country 2 in 
period I, and dumping does not occur; @ is 
given by 


(24) a=(1-7¢)7(1+¢9)N/M <1. 


Alternatively, given N, 7, 9, and a, there 
exists a value of M, M, such that if0 <M < 
M, country 1 exports z to country 2 in 
period 1, and dumping does not occur; M is 
given by the following:'® 

(25) 
COROLLARY 4: The exit of inefficient 
country-2 producers drives up the period-2 
price of z to p'>w and acts to shrink the 
country-I trade deficit in z. Furthermore, if 
(26) N>aM/[1-7(1- ¢)] 

such a shakeout in country 2 completely elim- 


inates the country-I1 trade deficit in z in pe- 
riod 2.'" 


While dumping of the sort investigated 
here cannot occur if demand is too low, it 
does not follow that, in the context of my 
model or a more general but similarly for- 
mulated model, that dumping must occur if 
demand is high. Given symmetry and N, 
there is a range in which demand may re- 
side such that dumping occurs. If demand is 
too low, dumping cannot occur because de- 


see this, note that, since p>w(l— To), 
d(p)<2aM/(—7g). Thus, if it is the case that 
2aM/U-—7)<27(i+@)N, country 1 is a net ex- 
porter of z in period 1. Equations (24) and (25) follow 
immediately. 

'7 This is not to say that country 2 cannot run a trade 
surplus in z in period 2. However, the conditions for 
this to occur will not be presented, although they are 
straightforward, if tedious, to derive. To see (26), note 
that if (23) is satisfied, w < p'’<w/(1— ¢) so that aM 
is the maximum period-2 demand for z in country 1. 
N[1—7(i—)] is the country-1 supply of z at any 
w <p’ <w/(1-—). Thus, if (26) is satisfied, domestic 
production in country 1 can meet demand. Note finally 
that ), so that the 
set of N that satisfies (26) and (23) (and thus Proposi- 
tion 7 and Corollary 4) is nonempty. 
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mand can be met by “experienced” firms at 
a price that falls below the required entry 
price. This is Corollary 4. If demand is 
sufficiently high, given N, dumping can fail 
to occur for two reasons. First, p’ can be 
driven above w /(1— ¢) so that there is no 
value to learning productivity this period: 
this is the second term in max{[:]. Second, 
even if p'’<w/(1—@) so that e<w, de- 
mand can be sufficiently strong so that the 
equilibrium p> w. This is the first term in 
max{ ]. 

I now examine the welfare implications of 
dumping in a free-trade equilibrium. For 
concreteness, I focus on the case in which 


(27) max[2/{1+7(1+@)}, 
1/{1-(1-¢)}]am 
<N<aM/7r(1+¢). 
As can be seen by comparing (23) and (27), 
any N that satisfies (27) will satisfy Proposi- 
tion 7. Furthermore, such an N will insure 
that p’=w in autarky in both countries, 
that p =w in autarky in country 1, and that 
p =e =w(1—7¢) in country 2.'* Under au- 
tarky, welfare (as measured by the sum of 


consumer and producer surplus in the mar- 
ket for z) is given by 


(28) [(aMw /P)aP 


in country 1, while welfare in country 2 is 
just 


(29) (aMw/P)aP + [ (aMw /P)aP. 


Note that the most efficient producers in 


'SPROOF: The country-2 results follow immediate- 
ly from Proposition 5: {1/[1-7U-p)}}JaM<N< 
aM/7(1+ ) is just the condition that per-country 
demand exceeds the supply of the most efficient pro- 
ducers but falls short of the total supply of all but the 
least efficient producers. 
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country 1 earn positive rents in autarky, 
while the net producer surplus in country 2 
is zero in autarky. This is a direct implica- 
tion of the fact that the period-1 autarky 
price is driven down to the ex ante entry 
price in country 2, and that the equilibrium 
period-2 price of z is pushed up to the 
ex ante break-even level w. 

Under the conditions of Proposition 7, a 
free-trade dumping equilibrium exists in 
which e < p<w and, by (27), p’=w. The 
change in welfare in country 1 is given by 


(30) [(aMw/P) dP 
p 


—(w- p)7(1+@)N>0. 


Despite the fact that dumping occurs in the 
free-trade equilibrium, country 1 unambigu- 
ously benefits from free trade. The loss in 
producer surplus resulting from the fall in 
the price of z represents a transfer from 
the most efficient producers to consumers. 
Since country-1 demand for z exceeds the 
output of the most efficient z-producers, 
the gain in consumer surplus must exceed 
the loss in producer surplus. Note that, rela- 
tive to autarky, employment in the z sector 
is lower by aM— (1+ @)N, as less effi- 
cient z-producers, unable to cover their op- 
portunity cost, shift to the y sector. Thus, 
both employment and profits in the country- 
1 z sector shrink in period 1 in a free-trade 
dumping equilibrium. 

The change in welfare in country 2 is 
given by 


(31) [(aMw/P) dP +(p—e)N>0. 
Pp 


Welfare rises in country 2 if world demand 
is sufficiently strong to pull up p > e. Other- 
wise, country-2 welfare is unchanged in the 
free-trade dumping equilibrium. How can 
this be? The supply of z in country 2 is 
completely elastic at the ex ante entry price 
e. Country-2 producer’s ex ante expected 
and ex post average rents are driven to zero 
at e. Although employment and production 
in the z sector increase in a free-trade 
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dumping equilibrium (relative to the au- 
tarky equilibrium), only a fraction 7 of these 
exporters will obtain positive producer sur- 
plus ex post; the remaining fraction 1— 7 
will suffer losses that exactly offset these 
gains. Obviously, country-2 consumers lose 
if world demand pushes p above e; but this 
loss must fall short of the gain to producers, 
because country-2 supply at p>e is equal 
to N, while country-2 demand at e (and 
thus at p) is less than N. Thus, I have now 
established the following proposition. 


PROPOSITION 8: Consider the free-trade 
dumping equilibrium that satisfies (27). Then, 
relative to autarky, the following must be 
true: 


(a) The welfare of country-1 consumers im- 
proves; the welfare, rents, and period-1 
output of country-1 z-producers contract; 
and the gain in country-1 consumer sur- 
plus exceeds the loss in producer surplus. 
The welfare of country-2 consumers does 
not improve; the welfare, rents, and 
period-1 output of country-2 z-producers 
do not contract; and the gain, if any, in 
country-2 producer surplus exceeds the 
loss, if any, of country-2 consumer sur- 
plus. 


Note that while less efficient z-producers 
in country 1 are unable to cover their op- 
portunity cost in period 1 of the dumping 
equilibrium, they only earn w in autarky. 
Thus the extent of injury in the z sector in 
country | is the loss of pure rents that 
accrue to the most efficient producers. It is 
easy to generalize this result. If N satisfies 
Proposition 7 such that the autarky price in 
country 1, pa, satisfies p,=p'>w and 
p'>w> in country-2 autarky, then 
welfare in country 1 still improves in the 
free-trade dumping equilibrium: 


(aMw /P) dP -(p'— p)r(1+ 


—(p’-—w)(1-27)N>0 


which can be shown to be positive if (23) is 


satisfied. The change in country-2 welfare is 
given by 


[(aMw /P) dP py)N>0 


which is also positive. In this equilibrium 
the z-producers who cannot cover opportu- 
nity cost in the dumping equilibrium actu- 
ally achieve lower utility than in autarky. 

I will now discuss the robustness of these 
results. Consider the willingness of potential 
entrants to suffer expected static losses to- 
day in exchange for higher returns in the 
future. In my construct, information about 
productivity has value if and only if this 
information means that “established” firms 
make different production decisions in the 
future than do new entrants. Of course, the 
return to an established firm cannot be lower 
than the return to a new entrant, and it will 
in general be higher. Furthermore, there 
must be greater supply of z in the first 
period than in the future because some 
inefficient entrants exit; thus, given demand, 
prices must rise between period 1 and 2. 
Incorporating the value of information, new 
entrants in period 1 are willing to take static 
losses to obtain the experience that comes 
from producing. This insight seems quite 
general. However, even in my simple model, 
it is not always true that established firms 
earn higher expected returns than do en- 
trants. If future price is sufficiently high, 
p'’=>w/(1— @), experienced and inexperi- 
enced firms choose the same sector, z, in 
the future. Since ex ante in period 0 experi- 
enced and inexperienced firms are the same, 
their expected period-2 returns are the 
same: there is no value to early entry. Thus, 
as I pointed out above, if demand is too 
strong dumping will not occur. However, 
there is a range in which the demand curve 
may reside such that demand is sufficiently 
high to induce entry and dumping, but not 
high enough to insure positive expected 
rents to all early entrants. These are exactly 
the parameter restrictions discussed at 
length above. 

I conclude this section with an investiga- 
tion of the welfare properties of the autarky 
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equilibrium in a country in which each po- 
tential entrant is initially uncertain about its 
productivity in the z sector. At the begin- 
ning of period 1, all potential z-producers 
are ex ante identical. Suppose they sign an 
Arrow-Debreu insurance contract that 
guarantees that each z-producer will share 
equally in the total output of all z-pro- 
ducers in both periods. Let p* denote the 
first-period price at which such a contract 
would require entry to the z sector. For any 
given vector of current and future prices 
(p,p’) such that w<p’<w/(1—@) and 
p> p*, the utility achieved by each poten- 
tial z-entrant is given by 


(32) 

+ 0(p)p'[7(1+ ¢)+(1-27)] 

+ 
where 0(p)= min[d.(p)/N, 1]. Note that a 
potential z-entrant will only agree to such 
an Arrow-Debreu contract if u(p, p’)>w + 
p’, the expected value of producing y this 
period, entering the z sector next period, 
and keeping the returns which, in expecta- 
tion, are equal to p’. It follows that the 
insurance contract will only require entry!” 
to the z sector in period 1 if p> p* such 
that 
(33) 0(p)p+4(p)p' [r+ ¢)+CU-27)] 


+ O( 
>O(p)w+O(p)p’ 
which is just the condition that p* satisfies 
(34) p*=w—-7(1-¢) 
x max[w /(1— @) — p’,0]. 
This is exactly the entry decision rule 


derived in Section III, equation (14). It fol- 
lows that, for any given p’, the amount of 


'YAs discussed in Section III, when d{p)<N, it 
is assumed that all potential z entrants in fact en- 
ter a lottery in which winners produce z and losers 
produce y. 
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entry and, in a two-country world, dumping 
that occurs is exactly the same irrespective 
of the presence or absence of an Arrow- 
Debreu insurance market. The Arrow- 
Debreu contract will stipulate that all 
signees enter the z market when p > p* =e. 
When p =e, each z-producer receives 


in period 1 (and of course each of the 
[1— @(e)]N signees who are y-producers also 
receive 0(e)e +[1 — 0(e)|w). With insurance, 
the loss to each z-producer is lower in the 
first period than is the expected loss to each 
z-producer in the absence of insurance. In 
period 2, each insured z-producer receives 


(35) 
+ 0( p)wr+{l = p)| p’ 


which is less than the expected return, in 
the absence of insurance, of 
received by each experi- 
enced z-producer in period 2. The key point 
is that, with risk neutrality, the facility to 
pool risk afforded by an insurance contract 
does not alter conditions under which it is 
optimal to enter the z sector in the first 
period. Such a contract insures that each 
firm earns the same utility u(p,p’). In the 
absence of such a contract, the expected 
utility of each potential z entrant is u(p, p’). 

Having demonstrated in Proposition 8 
that, relative to autarky, welfare in the 
free-trade dumping equilibrium must im- 
prove, I now show that the autarky competi- 
tive equilibrium in fact maximizes the sum 
of consumer and producer surplus. Again, 
for concreteness I focus on the case in which 
rents are driven to zero in autarky equilib- 
rium, which by Proposition 5 will be true if 
Since p’=w in 
such an equilibrium, producer surplus at 
any first-period price p is given by 


(36) ps(p)=(p—w)d(p) 
+[w(1+¢)—w]md(p). 


Let p<p. Then the change in producer 
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surplus may be expressed as 
(37) ps( p)—ps(p) 
=(p—p)d(p) 
+[p—wU— d(p)}. 
The change in consumer surplus is just 
(38) cs( p)-—cs( p) = /P) dP 


>(p— p)d(p). 


Thus, for all p< p, social surplus at p ex- 
ceeds social surplus at p if 


(39) p>e=w(l- 
Now let p=e. If p <e, (37) implies that 
(37') ps( p)—ps(e) =(p—e)d(p) 


which means that producer surplus unam- 
biguously contracts. Furthermore, since 


(38’) cs( p)—cs(e) 


= [ /P) dP <(e-—p)d(p) 
Pp 


social surplus at p<e falls short of social 
surplus at e. Thus, I have just proved the 
following proposition: 


PROPOSITION 9: Consider autarky 
equilibrium in which the expected rent to a 
potential entrant is driven to zero. This equi- 
librium, which will occur whenever N > 
aM/{1—7U-—)], maximizes the sum of 
consumer and producer surplus subject to the 
constraint that all potential entrants to the z 
sector are ex ante identical. 


V. Implications 


In this section, I explore some key impli- 
cations of the model of entry, dumping, and 
shakeout. In pursuit of this goal, it will be 
useful to generalize slightly the model of 
Section IV to allow for differences between 
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countries in w, productivity of labor in the y 
sector, and thus the opportunity cost of 
resources in the z sector. In particular, let 
@w<w denote the opportunity cost of re- 
sources in country 2 and suppose that 


<w<w. 


I continue to assume that distribution of 
productivities across firms is identical in the 
two countries but that no potential entrant 
in country 2 knows its firm-specific produc- 
tivity in the z sector until it has actually 
produced z. I begin by stating sufficient 
conditions for a free-trade dumping equilib- 
rium to exist in such a world economy. 


PROPOSITION 10: Given 7, 9, w, w, M, 
and a, there exists a free-trade dumping equi- 
librium if (39) is satisfied and if: 


(40) ¢)]} 
XaM(w+w)/o 


In this equilibrium, p < w, p' = w, country 1 
imports z from country 2 in both periods, and 
dumping occurs in the first period.” 


Because w < w and the distribution G(¢) 
of productivities across firms in the z sector 
is the same in both countries, country 2 has 
a comparative advantage in the z sector, 
and it will export z to country 1 in both 


“PROOF: aM(w+w)/w is the world demand 
for z at the country-2 break-even price w. The mini- 
mum world supply at p’=@ is just @)N+ 
(1—7(—@))N since, by (39), w<w. At any p’<w, 
27(1+ @)N is the maximum world supply of z. Equa- 
tion (40) follows immediately as the condition for p’ = 
w. For dumping to occur, p < w, which will be true if 
aM(w+w)/w<N+7(1+ 9), as implied by the first 
inequality in (40). It must also be true that aMw/w 
exceeds 7(1+ )N, as is implied by the second in- 
equality in (40). 
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periods. However, notwithstanding its com- 
parative advantage in the z sector, country 
2 can still dump in period 1 for the reason 
outlined in Sections II and III: firms are 
willing to enter the z sector at a price that 
fails on average to cover the opportunity 
costs of resources in period 1 if the most 
efficient among them earn rents in period 2. 

Even with a comparative advantage in z, 
country 2 (the “low-wage” country) dumps 
in period 1 if the conditions of Proposition 
10 are met. Again, three conditions must be 
met if dumping is to occur: there must be 
value to period-1 entry, p’ < w /(1— ); de- 
mand must exceed twice the supply of the 
efficient country-1 producers (otherwise 
these producers export to country 2); and 
demand at the break-even price w must not 
exceed potential world supply at that price. 
These are the parameter restrictions that 
are defined by Proposition 10. In fact, for 
concreteness, Proposition 10 sets out re- 
strictions such that p’ = w. 

While country 1 continues to import z in 
period 2, there is shakeout of inefficient 
country-2 producers and a contraction in 
country-2 output in period 2 that pushes up 
the world price of z to p’ = w.”' I now show 
that, under a mild additional restriction on 
the parameter ¢ (the fraction by which the 
outputs of the most and least efficient firms 
differ from the output of the average firm), 
it is possible to obtain simple and intuitive 
expressions for the rate at which inefficient 
firms exit during the shakeout as well as the 
average loss incurred by new entrants in 
period 1. 


PROPOSITION 11: If ¢ <4, then in any 
free-trade dumping equilibrium that satisfies 
Proposition 10, the equilibrium price of z in 
period 1 is driven down to p = w(1— 1), the 
average loss incurred per new entrant in pe- 
riod I is equal to —7gw, and the rate at 


However, because 
w, the rise in the world price of z does not call forth an 
expansion in the country-1 output of z, which in both 
periods is given by 7(1+ @)N. 


3 
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which inefficient country-2 firms exit during 
. 
the shakeout is constant and equal to 7.~~ 


Proposition 11 highlights an interesting 
feature of the particular dumping equilib- 
rium studied in this section; namely, given 
1—27, the probability of being a firm of 
average productivity, the exit rate is inde- 
pendent of the average loss per entrant. 
The larger is ¢, the fraction by which the 
output of the most (and least) efficient firms 
differs from the output of the average firm, 
the smaller will be p, and thus the greater 
will be the average loss per entrant. Since 
p’ = a, all low-productivity entrants will exit 
as part of the shakeout. These low-produc- 
tivity firms represent a fraction 7 of all 
period-1 entrants. The condition that g < 4 
is just the condition that the output pro- 
duced by each exiting firm, 1— , is suffi- 
ciently great so that only low-productivity 
firms exit as part of the shakeout, and 
ex post all entrants of average and, of course, 
high productivity produce in both periods. 

Thus, under the conditions of Proposition 
11, some part of the world demand for z in 


period 2 will be met by new entry in country 
2. This means that the change in country-2 
output is the sum of two forces working in 
opposite directions: the shakeout of ineffi- 
cient period-1 entrants and the entry of new 
z-producers in period 2. Of course, under 


PROOF: Letting D(p)=aM(w+w)/p, then 
— 7e))— 7+ is the output 
produced by exiting firms at the period-1 price p= 
w(1—7g). The contraction in country-2 output be- 
tween periods 1 and 2 is just D(w(1—7o))— Dlw). 
The output produced by exiting firms will exceed the 
contraction in country-2 output if 


From (40), this will be true if (1— 7e)/U—2¢) 
< 2. Rearranging, this is just the condition that 39 < | 
+ mo — 1’, which for any 7 > 0, will be true if g <4. 
If 
From (40), it is known that 


D(w)/[1-7U-¢)+ ¢)]<N. 


It is easy to verify that, if p <4, 1—7ol+7(1+ )) 
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the conditions of Proposition 11, the reduc- 
tion in output resulting from the shakeout, 
in absolute value, must exceed contribution 
to output arising from the new entry. How- 
ever, it follows that the contraction in out- 
put understates the amount of shakeout and 
dumping for two reasons. First, because of 
the entry of new z-producers in period 2, 
the contraction in output must be less than 
the output produced by inefficient firms that 
exit during the shakeout. Second, the num- 
ber of firms that exit as part of the shakeout 
understates the number of firms that dump 
in period 1. This follows from the fact that 
p is driven down to w(1— 7@), so that firms 
of average productivity, (1—27)N in num- 
ber, also dump in period 1. Yet none of 
these firms exit in period 2. In other words, 
the fact that a firm does not exit as part of 
the shakeout does not imply that such a 
firm did not dump in the first period. 

Proposition 11 does not rule out a posi- 
tive correlation between the exit rate and 
the average loss per entrant. Indeed, given 
y, both the exit rate and the average loss 
per entrant are increasing in zr. Thus, the 
effect of a mean-preserving spread of G(¢) 
is to raise both the exit rate and the average 
loss per entrant. Note finally that if 7 and 
© are negatively correlated, so that sectors 
with few firms of average productivity are 
also sectors in which the outputs of the 
most and least efficient firms differ little 
from the output of the average firm, then 
the exit rate and the average loss per en- 
trant can in fact be negatively correlated. In 
sum, without rather detailed knowledge of 
the distribution of productivities among po- 
tential entrants, a correlation of either sign, 
or of zero, between exit rates and average 
losses is consistent with some version of the 
model. 

For a given world demand curve, I have 
shown that the contraction in output in the 
dumping country that follows the initial 
stage of entry and shakeout understates the 
amount of shakeout and dumping that actu- 
ally occurs. I now generalize the model to 
allow for growth in world demand and show 
that it is possible for the initial stage of 
dumping and shakeout to be followed by an 


expansion in output in the dumping coun- 
try. To see this, let aM'(w+w)/p’ denote 
the world demand for z in period 2, and 
suppose for concreteness that M’/M= 
(1+ g)/U—7@). With this rate of world 
demand growth, the demand for z at 
the period-2 price of p’=w exceeds the 
demand for z at the period-1 price p= 
w(1— 7g). If N is such that 


(41) 
XaM'(wt+w)/o 
<N<aM(w+w)/w27(1+¢) 


is satisfied, then dumping occurs in period 
1, and inefficient firms exit as part of the 
shakeout; but output expands after the 
shakeout in response to the fully anticipated 
rise in world demand.” 

In addition to the above-mentioned im- 
plications, there are at least three addi- 
tional insights conveyed by this approach 
that may be directly compared with the 
record so as to assess the relevance of the 
theory. First, following the lead of Michael 
Gort and Steven Klepper (1982), the model 
implies that dumping should be observed in 
younger industries in which new technology 
—new at least to the dumping country— 
plays an essential role. Second, the model 
implies that, even in countries in which an 
existing production base is in place, a surge 
in demand, to the extent that it attracts new 
entrants or new additions to capacity, may 
be associated with dumping. Third, the 
model implies that dumping should be ob- 
served in countries whose firms have not yet 
mastered an existing technology or acquired 
sufficient experience in producing the 
dumped good. I consider in turn the empiri- 
cal support for each of these predictions. 


*3Note that the set of N that satisfies (41) is 
nonempty if 
(1-7¢)/27(1+ ¢) 
> {1/[1- ¢)+ ¢)]}. 


For any given 7, there exists sufficiently small but 
positive y and g such that (41) is satisfied. 
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Gort and Klepper (1982) develop an evo- 
lutionary theory of industry structure fol- 
lowing the introduction of a product innova- 
tion.24* According to their theory and the 
case studies of American industries they 
report, the markets “for most new products 
appear to pass through at least five distin- 
guishable stages in the course of their evo- 
lution. New industries generally pass 
through a stage (IV) in which the number of 
producers declines significantly” (Gort and 
Klepper, 1982 p. 651). In Gort and Klepper’s 
descriptive model, a new product is com- 
mercially introduced in stage I, followed by 
a stage II in which the entry rate and indus- 
try output accelerate rapidly. In stage III, 
the rate of net entry declines dramatically, 
ultimately to zero, as a result of “the ac- 
cumulation of experience by existing 
firms...and a gradual reduction in the pop- 
ulation of potential entrants that have not 
as yet entered the market” (Gort and Klep- 
per, 1982 p. 633). In the aforementioned 
stage IV, the “less efficient firms who are 
unable to imitate the leaders...are forced 
out of the market... . Consequently, the exit 
rate rises sharply” (Gort and Klepper, 1982 
p. 633). Finally, in stage V, the net entry 
rate returns to zero, where it remains until 
the eventual shrinkage of the market is 
brought on, either by product obsolescence 
or the introduction of a competing product. 

Following the lead of Gort and Klepper 
(1982) and their case-study approach, | 
searched the record of dumping cases filed 
during the last decade to isolate those in 
which new technology and a substantial shift 
in world demand might be expected to play 
a prominent role. I confined my attention to 
cases in which the Commerce Department 


*47 thank a referee for bringing the Gort and Klep- 
per (1982) paper to my attention. The literature on 
competitive, as opposed to strategic, models of selec- 
tion in industries dates back at least to Armen Alchian 
(1950) and Richard R. Nelson and Sidney G. Winter 
(1982). Following on the seminal work of Jovanovic 
(1982) and Steven Lippman and R. Rumelt (1982), 
recent papers by Hugo Hopenhayn (1986), Ariel Pakes 
and Richard Ericson (1988), and Jovanovic and Saul 
Lach (1989) have begun to explore the implications of 
learning, entry, and exit on industry structure in a 
closed-economy context. 
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ruled that sales “at less than fair value” 
were in fact made. Within this subset of 
cases, I examined only those in which the 
Commerce Department employed the “‘con- 
structed value” approach to determine fair 
value. Constructed value is the Depart- 
ment’s estimate of the average cost of pro- 
duction. It is employed whenever there are 
insufficient sales (and many times no sales) 
in the domestic market that can be used to 
calculate the domestic sales price. Thus, a 
finding of dumping (sales at less than fair 
value) based upon constructed value is a 
finding that the dumped goods have been 
exported at a price falling short of the Com- 
merce Department’s estimate of average 
cost. 

Among the cases I isolated in which tech- 
nology and robust world demand played a 
prominant role, two stand out: the dumping 
of cellular telephones and cellular-tele- 
phone subassemblies by Japan in 1984; and 
the dumping of small-business telephone 
systems and small-business telephone sub- 
assemblies by South Korea in 1988. In 1985, 
in response to a complaint filed by the Mo- 
torola Corporation in November 1984, the 
Commerce Department ruled (Federal Reg- 
ister, Vol. 50 [1985] p. 24554) that Japanese 
manufacturers were dumping cellular tele- 
phones and cellular-telephone subassem- 
blies in the United States. In its ruling, the 
Department noted that, because of insuffi- 
cient sales in the domestic market by at 
least three firms, the finding of dumping 
was based on evidence that cellular tele- 
phones were being exported at a price that 
failed to cover average cost. In 1989, in 
response to a complaint filed by the AT&T 
and Comdial Corporations, the Commerce 
Department ruled (Federal Register, Vol. 54 
[1989] p. 31980) that South Korean manu- 
facturers were dumping small-business tele- 
phone systems and subassemblies in the 
United States. Again, the finding of dump- 
ing was based on evidence that these busi- 
ness telephone systems were being exported 
to the United States at a price that failed to 
cover average cost. 

Steven Olley and Pakes (1988), in their 
recent study of the U.S. telecommunications 
industry, have documented that the demand 
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for communication equipment, and espe- 
cially imported equipment, surged after the 
breakup of AT&T in the early 1980’s. In 
1982, imports represented 4 percent of new 
supply in the U.S. market; by 1987 the share 
of imports had more than tripled to 14 
percent. Olley and Pakes (1988) also docu- 
ment that this surge in demand for imports 
accompanied a substantial jump in the entry 
(and exit) rate of new plants, owned by 
domestic as well foreign producers, in the 
U.S. market. They argue that, “entry into 
the telecommunications equipment market 
was effectively prohibited...prior to the 
AT&T divestiture [since] Western Electric 
supplied 90 percent of AT&T equipment 
purchases” (Olley and Pakes, 1988 p. 4). 
Furthermore, it was not until 1978 that the 
Federal Communications Commission even 
permitted the connection of such private 
subscriber equipment as telephones and 
PBX systems to the AT&T network. While 
I was not able to obtain detailed data on 
entry and exit rates at the plant level for the 
Japanese cellular-telephone industry or the 
South Korean business-telephone-system 
sector, it is noteworthy that the constructed- 
value definition of dumping was employed 
by the Commerce Department precisely be- 
cause of insufficient home-market sales of 
these products; price discrimination was 
neither alleged nor documented. Further- 
more, because of the surge in demand for 
telecommunications equipment following 
the AT&T divestiture, these cases would 
not appear to represent episodes of cyclical 
dumping. 

A second implication of the theory is 
that, even in countries in which an existing 
production base is in place, a surge in de- 
mand, to the extent that it attracts new 
entrants or new additions to capacity, may 
induce dumping. To understand this in the 
context of the model, suppose that, at the 
beginning of period 1, a fraction 6 of the N 
potential country-2 z-producers know their 
productivity in the z sector. It is straightfor- 
ward to show that if 


(42) 


Proposition 10 implies that country 2 will 


dump z in country 1, despite the fact that 
country 2 has in place an existing base of 
producers who share with producers in 
country 1 a common stock of knowledge 
about the technology for making z.”” Among 
the cases I investigated in which dumping 
was associated with a surge in demand for 
the product and a concomitant substantial 
expansion in output, I will mention two: the 
dumping of steel by Brazil in 1983 and 1984 
and the dumping of color television sets by 
Taiwan in 1983. 

In October 1982, just before the end of 
the 1981-1982 recession, the United States 
placed quota limits on the steel exports of 
the European Economic Community and 
reached an informal agreement with the 
Japanese to limit their U.S. exports (1. M. 
Destler, 1986 p. 130). In 1983, as the U.S. 
economy began to recover and ‘“‘in the af- 
termath of the quota limits placed on EEC 
steel in October 1982 [and the restraint 
demonstrated by Japanese producers]... 
a large upsurge of developing country 
steel... flowed into the United States” (John 
Conybeare, 1987 p. 212). A rash of dumping 
complaints, especially against Brazilian steel 
exporters, followed. According to the data 
compiled by Destler (1986), no fewer than 
eight findings of dumping steel at less than 
fair value were issued by the Commerce 
Department against Brazil in 1983 and 1984. 
To name just two, the department, in re- 
sponse to complaints filed by the Bethlehem 
Steel and United States Steel corporations, 
ruled that at least four Brazilian producers 
were dumping hot rolled steel plate and hot 
rolled steel sheet in the United States 
(Federal Register, Vol. 48 [1983] p. 40419). 
Because of insufficient domestic sales, the 
Commerce Department ruled that the US. 
price did not cover the constructed value of 
the Brazilian steel. 

The surge in steel exports from Brazil did 
not reflect a diversion of a relatively fixed 
supply to the U.S. market. According to 
U.S. Iron and Steel Institute data, exports 


*°To see this, note that p<w if aM(wt+w)/wo< 
m(1+ + +(1—5)N. When (42) is sat- 
isfied, [711+ 9)+ 701+ +(1—- > ¢)4+1- 
a(1— ¢), so that (40) is sufficient for dumping to occur. 
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of Brazilian steel to the United States were 
432,000 net tons in 1979, 604,000 tons in 
1982, 1,257,000 tons in 1983, 1,460,000 tons 
in 1984, 1,701,000 tons in 1985, and after a 
quota agreement between Brazil and the 
United States entered into effect, only 
1,103,000 tons in 1986. Total production of 
steel by Brazil was 15,300,000 net tons in 
1979, 14,324,000 tons in 1982, 16,159,000 
tons in 1983, 20,266,000 tons in 1984, 
22,546,000 tons in 1985, and 23,405,000 tons 
in 1986. Thus, between 1982 and 1985, the 
near trebling of Brazil steel exports to the 
United States accounted for less than 12 
percent of the increase in total steel output, 
which rose by nearly 60 percent.”° Further- 
more, this was not a period during which 
the total U.S. demand for steel was falling. 
In fact total U.S. steel demand (imports 
plus domestic production less exports) was 
roughly 80,000,000 tons in 1982, 85,000,000 
tons in 1983, 100,000,000 tons in 1984, and 
96,000,000 tons in 1985.77 

Consider next the dumping of color tele- 
vision sets by Taiwan. In 1983, in response 
to a complaint filed by six U.S. firms, the 
Commerce Department ruled that Taiwan 
producers were dumping color television 
sets in the United States. In its ruling 
(Federal Register, Vol. 48 [1983] p. 48491), 
the Commerce Department noted that, be- 
cause at least two firms, RCA and Hitachi, 
had no sales of color television sets in Tai- 
wan or in third markets, the finding of 
dumping was based upon sales in the United 
States at a price that fell below constructed 
value; price discrimination was of course 
neither alleged nor established for these 
firms. Total U.S. demand for television sets 
rose by more than 20 percent between 1982 


61 was unable to obtain plant-level data on entry 
and exit in the Brazilian steel industry. However, ac- 
cording to the industry publication American Metal 
Markets, Brazil continued to add to its steel capacity 
during the mid-1980’s while world steel capacity 
was shrinking (see “A Continued Shakeout Among 
Steelmakers Is Probable Through the Early 1990’s” 
[ American Metal Markets, 1 November 1985, p. 1)). 

“Of course, the U.S. steel industry contracted sub- 
stantially between 1979 and 1982. In 1979, total U.S. 
demand for steel-mill products was approximately 
115,000,000 tons, and total imports of steel-mill prod- 
ucts were 17,500,000 tons. 
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and 1983 to more than 21,000,000 sets, while 
total U.S. production rose by more than 10 
percent, to more than 11,000,000 sets.** To- 
tal exports of color television sets from Tai- 
wan rose from 600,000 sets in 1982 to 
1,500,000 sets in 1983. This did not repre- 
sent a diversion of a given output level. 
According to industry trade publications, 
total output of color television sets in Tai- 
wan was less than 1,300,000 units in 1980; 
by 1984 total exports alone were approach- 
ing 2,000,000 sets.7° 

A third implication of the model is that 
dumping should be observed in countries 
whose firms have not yet mastered an exist- 
ing technology or have not yet acquired 
sufficient experience in producing the 
dumped good. An examination of the record 
reveals several instances in which countries 
have been found dumping goods for which 
there is no home market in the United 
States during a cyclical expansion. Such 
goods would not appear to be capital- or 
technology-intensive. Two will be men- 
tioned here: the dumping of photograph 
albums by Hong Kong in 1984 and the 
dumping of thermostatically controlled ap- 
pliance plugs by Malaysia in 1987. 

In 1985, in response to a complaint filed 
by Esselte Pendaflex, Inc., in February 1985, 
the Commerce Department ruled (Federal 
Register, Vol. 50 [1985] p. 28830) that ex- 
ports of photograph albums from Hong 
Kong were being dumped in the United 
States. The department calculated the con- 
structed value of the albums because there 
was no home market for the goods. In 1988, 
in response to a complaint filed by the 
Triplex Inter Control Company in May 1988, 
the Commerce Department ruled (Federal 
Register, Vol. 53 [1988] p. 37827) that ex- 
ports of thermostatically controlled appli- 


*8although the Census Bureau reports import vol- 
umes separately for color and black-and-white sets, it 
only reports domestic production by five-digit SIC code 
36512, “television receivers.” 

“For production and export data on Taiwan color 
television sets, see The Japan Economic Journal (21 
June 1981, p. 9), The Journal of the Asian Economics 
Union (June 1984, p. 16), and Les Echos (28 November 
1983, p. 9). 
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ance plugs from Malaysia were being 
dumped in the United States. Again, the 
Commerce Department based its dumping 
findings on the constructed-value calcula- 
tion because of insufficient or nonexistent 
domestic market sales; price discrimination 
was neither alleged nor documented. A 
check of seven-digit SIC data revealed that 
the U.S. production of photograph albums 
(SIC 27824 41) actually rose substantially 
between 1983 and 1984 and was essentially 
unchanged in 1985, while the U.S. produc- 
tion of “temperature responsive controls for 
major appliances” (SIC 38220 75) was es- 
sentially flat in 1985, 1986, and 1987. 

It should be noted that, without rather 
detailed data on entry and exit at the plant 
level in the dumping country, it is not for- 
mally possible to confirm the theory of en- 
try, dumping, and shakeout presented in 
this paper. Only very recently have re- 
searchers begun to characterize actual pat- 
terns of firm entry and exit in the U.S. 
economy. As pointed out recently by Timo- 
thy Dunne et al. (1988 p. 45), “Despite wide 
recognition of the important role of entry 
and exit, surprisingly little is known about 
the patterns of firm entry and exit in the US 
economy.” Nonetheless, the model pre- 
sented here does help to provide one expla- 
nation for dumping cases in which (i) price 
discrimination is neither alleged nor docu- 
mented, (ii) there is a surge in total demand 
for the dumped product, (iii) there is a 
surge in total production of the dumped 
product in the dumping country, and 
(iv) the dumping country is among the group 
of newly industrializing countries whose 
share of world manufacturing trade has risen 
so damatically during the past ten years. 


VI. Concluding Remarks 


This paper has developed a model of 
dumping that is similar in spirit to Jovanovic 
(1982), but in which entry and exit and the 
terms of trade are endogenous. The novel 
feature of the two-country Ricardian model 
developed in this paper is that dumping can 
arise in the free-trade equilibrium even if 
the two countries differ only in their initial 
stock of technological knowledge. 
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Intuitively, dumping can arise because the 
present value of entry includes the value to 
the individual producer of learning his pro- 
ductivity parameter. Such information is 
valuable because the most productive pro- 
ducers earn positive rents in equilibrium. 
Competition among ex ante identical pro- 
ducers for these present and future rents 
can drive down the price of the product 
sufficiently so that the industry suffers losses. 
Ex post inefficient producers, upon learning 
their productivity parameter, recognize their 
losses and exit the sector. The rents earned 
by the most productive producers offset the 
opportunity cost previously incurred. 

The reason why entry to the z sector is 
costly for a new entrant is that, for the 
length of a “production period,” that en- 
trant is unable to produce and consume w 
units of y. In this sense, factor specificity is 
crucial for this, as well as other, explana- 
tions of dumping. However, a specific-fac- 
tors explanation that excludes a role for 
learning via production would not appear to 
predict that dumping should be observed in 
periods of high demand for the dumped 
product. Specific-factor models of the sort 
developed by Ethier (1982) illustrate why 
dumping can occur when demand is transi- 
torily low, not permanently high. 

In my analysis, I have said that a firm 
dumps if it exports at a price that fails to 
cover the opportunity cost w. Of course, 
with the simple Ricardian-entrepreneur 
setup of this paper, there is no explicit wage 
bill paid by a firm. Rather, there is a return 
that must be earned if the Ricardian en- 
trepreneur is to remain in the z sector. 
Given the Commerce Department’s method 
of constructed value, the use of the static 
opportunity cost of resources in this world 
(w) to define dumping does not seem to be 
at variance with the definition that is actu- 
ally employed by the Commerce Depart- 
ment in dumping cases. 

In my analysis, I have abstracted from the 
durability of dumped goods. If dumped 
goods can be purchased and stored, it is 
natural to ask why they are sold when the 
price is expected to increase, because of the 
mechanism outlined here, because of the 
mechanism outlined in Ethier (1982), or be- 


MARCH 1993 


cause of the expectation that the Commerce 
Department will impose dumping duties on 
the culprits. Does this not violate the inter- 
temporal arbitrage condition? In the ab- 
sence of transaction costs, it of course does. 
In fact, many if not most of the goods that 
are said by the Commerce Department to 
be dumped are durable and storable. The 
fact that I, along with others, ignore this 
question does not suggest that it is not 
important. 

In the Introduction, I alluded to the liter- 
ature on cyclical dumping. Two recent pa- 
pers, by Gruenspecht (1988) and Andrew 
Dick (1988) study the connections between 
internal scale economies (along a learning 
curve in the sense of Michael Spence [1981]), 
an exogenous duopoly market structure, and 
the resulting Nash equilibrium. The basic 
message of both papers is that, since the 
relevant marginal cost of an extra unit of 
output includes the reduction in future costs 
that will be forthcoming as a firm moves 
down the learning curve, observed unit cost 
can exceed the shadow marginal cost and, 
depending upon the elasticity of demand, 
the current price of output. Thus, dumping 
can result because measured unit cost over- 
states true marginal cost. 

Clearly, the common definition of dump- 
ing embodies several related but distinct 
phenomena. The purpose of this paper has 
most surely not been to argue that all 
dumping is the result of the process of entry 
and shakeout that has been outlined above. 
Rather, the modest goal has been to show 
one mechanism by which transitory differ- 
ences in the stock of technological knowl- 
edge in otherwise identical countries can 
result in international trade and, under cer- 
tain conditions, dumping, and to evaluate 
some of the implications. 


APPENDIX 


PROOF OF PROPOSITION 5: 

From Corollary 2, when p’<w/(1-— 9), 
e <P. The minimum supply of z in period 2 
at a price p’ > w is given by 


N[1-7(1-@)]. 
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This is because at most 7/N firms will be 
unwilling to produce at p’ >w. These are 
the (at most) wN firms that entered in 
period 1 and found their productivity pa- 
rameter to be |— g. Since total output of z 
is NEE =1, N[1— ¢)] is the minimum 
period-2 output of z at p’ >w. Thus, from 
(13), — ¢) < — ¢)] is the re- 
striction that insures that p’<w/(1— 9). 
Equation (17) and the second term in the 
max{[-] in (18) follow immediately. For p < 
w, demand at a price w must fall short of 
the elastic supply of z, N, when p equals or 
exceeds the entry price e<w. This will be 
true when aMw/w <N. The first term in 
the max[-] in (18) follows. By similar rea- 
soning, aM < N[1—7(1— ¢)] is the condi- 
tion that, along with the condition g <+ 
(Corollary 2), p’=w. When the period-2 
price is pushed to the break-even price w, 
the period-1 ex ante entry price e is, from 
equation (14), equal to — wo). 


PROOF OF PROPOSITION 7: 
The proof is similar to that of Proposition 
5. From Corollary 2, when p’<w/(- 9), 


e<B. The minimum world supply of z in 
period 2 at a price p’>w is given by 
2N[1—7(1—)]. This is because at most 
aN firms in each country will be unwilling 
to produce at p’>w. These are the (at 
most) zN country-2 firms that entered in 
period 1 and found their productivity pa- 
rameter to be 1— ¢, plus the 7A inefficient 
country-1 firms. Since total output of z is 
2NEE =2N, 2N[1—7(1— ¢)] is the mini- 
mum period-2 output of z at p’ > w. Thus, 
from (13), ¢)<2N[1— 7 
is the restriction that insures that p’ < 
w/(1—@). The second term in max{-] in 
(23) follows immediately. For the first-period 
equilibrium price to be driven below the 
break-even price, p < w, world demand at a 
price w must fall short of the elastic supply 
of z when p equals or exceeds the entry 
price e<w. This supply is given by N+ 
It is the sum of elastic country-2 
supply at the entry price e and the supply of 
the most efficient producers in country 1. 
When aMw/w< N+ @)N, 
p<w if p’<w/(U-—g). The first term in 
max{[-:] in (23) follows. Finally, for dumping 
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to occur in period 1, it must be the case that 
d(p)>2m7U+@)N; otherwise country 1 
will export z to country 2. Since p<w by 
the first inequality, this condition will be 
satisfied if 2aMw/w>27(1+@)N; the 
second inequality follows. The set of N that 
satisfies (24) is nonempty for all 7 <5, 9 <1, 
and positive a, since [1— ¢)]> 
and 1+ 7(1+ g)>27(1+ g). 
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Union Organizing Activity, Firm Growth, and the 
Business Cycle 


By STEPHEN G. BRONARS AND DONALD R. DEERE* 


This paper analyzes the relationship between changes in unionization and firm 
growth. Average growth is significantly lower in manufacturing firms that 
experience successful union elections, but these strong “effects” are largely 
illusory. We find no evidence of a significant relationship between unionization 
and firm growth, despite a strong cyclical pattern in election activity. Our results 
suggest that the significant negative effect of organizing activity on a firm’s 
market value is not accompanied by any growth changes. We therefore cannot 
reject the hypothesis that the equity losses from union election activity represent 
a simple transfer of wealth from shareholders to workers. (JEL J50) 


A large body of empirical evidence indi- 
cates that unionization is negatively related 
to firm profitability (see e.g., Kim B. Clark, 
1984; Michael A. Salinger, 1984; Thomas 
Karier, 1985; Robert A. Connolly et al., 
1986; Paula B. Voos and Lawrence R. 
Mishel, 1986; Nancy L. Rose, 1987; John M. 
Abowd, 1989a; Barry T. Hirsch, 1991).' In 
particular, increases in union organizing ac- 
tivity at a firm result in significant losses 
for shareholders. Richard S. Ruback and 
Martin B. Zimmerman (1984) and Bronars 
and Deere (1990) find that the market value 
of the firm declines by 3-4 percent follow- 
ing a successful union representation elec- 
tion and by 1-2 percent after an unsuc- 


*Bronars: Department of Economics, University of 
Texas, Austin, TX 78712; Deere: Department of Eco- 
nomics, Texas A&M University, College Station, TX 
77843. A portion of the work on this paper was com- 
pleted while Deere was on leave at UC-Santa Barbara. 
This material is based in part upon work supported by 
the Texas Advanced Research Program under grant 
number 14581 to Deere. We thank George Borjas, 
Melissa Famulari, Jeff Grogger, Brooks Pierce, Robert 
Reed, Doug Steigerwald, Larry Summers, Joe Tracy, 
Steve Trejo, and an anonymous referee for helpful 
comments. We are grateful to Richard Ruback and 
Martin Zimmerman for generously providing us with 
their data. Peter Griffin provided excellent research 
assistance. Any remaining errors are ours. 

'In isolated cases higher union productivity may 
offset the union wage gains (see e.g., Charles Brown 
and James L. Medoff, 1978; Kim B. Clark, 1980; Steven 
G. Allen, 1984, 1987). 
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cessful election. These estimates surely 
understate the impact of unionization on 
profitability because stock prices react only 
to new information. Given the sizable effect 
of union organizing activity on the market 
values of firms, it is important also to exam- 
ine the relationship between union repre- 
sentation elections and firms’ employment, 
investment, and output decisions. 

If firms alter their inputs or output in 
response to a union election, then a portion 
of the associated decline in equity repre- 
sents a loss to shareholders that the union 
does not capture. On the other hand, if 
unions extract wealth efficiently, a union 
election will not cause changes in firms’ 
behavior but will merely transfer income to 
union members. 

Unions achieve wage gains by appropriat- 
ing rents or quasi-rents that would other- 
wise accrue to shareholders. Increases in 
employment, capital investment, or output 
generate such rents and may thus increase 
union organizing activity. Alternatively, 


“Analyzing firms’ behavior around union elections 
thus provides a useful method for gauging the effi- 
ciency of new unionization. There have been several 
recent analyses of the efficiency of union contracting 
between a firm and an existing union, with mixed 
results (see Clark, 1984; James N. Brown and Orley C. 
Ashenfelter, 1986; David Card, 1986; Randall W. 
Eberts and Joe A. Stone, 1986; Thomas E. MaCurdy 
and John H. Pencavel, 1986; Abowd, 1989a). 
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growing firms may be less susceptible to 
unionization because of an increased ability 
to maintain worker morale. A careful analy- 
sis of the changes in inputs and output and 
the occurrence of union elections can help 
to clarify the causal relationship between 
firm behavior and union organizing activity. 

This paper examines how union represen- 
tation elections and employment, invest- 
ment, and output decisions are related us- 
ing a 20-year panel of data on large publicly 
traded firms. Average growth rates of inputs 
and output are indeed significantly lower in 
firms that experienced successful union 
elections in the manufacturing sector, but 
these apparently strong “effects” of union- 
ization are largely illusory. 

In our search for causal linkages, we per- 
form vector autoregressions of union elec- 
tions and firm growth. Statistical tests for 
Granger-causality provide some evidence of 
causation running from union elections to 
firm growth, but no indication of causation 
from growth to elections. The direction of 
the effects on growth differ across manufac- 
turing and nonmanufacturing firms, how- 
ever, and the strongest effects on growth are 
associated with elections that the union 
loses, while there is essentially no change in 
growth after union wins. We do find that 
representation elections tend to occur near 
peaks in aggregate economic activity, mak- 
ing it somewhat difficult to isolate the ef- 
fects of these elections from cyclical factors. 

Taken together these results provide the 
key finding of the paper: that while union 
election activity has a strong cyclical compo- 
nent, it appears to be largely unrelated to 
changes in the firm’s inputs or output. Thus, 
the main implication of the analysis is that 
the negative equity effect of union elections 
is not accompanied by any impact on firm 
growth. 

The next section describes the data set 
that allows us to test for causality between 
union organizing activity and a firm’s input 
and output decisions and briefly summa- 
rizes the important empirical regularities in 
union election activity and firm growth. This 
section documents the significant, negative 
cross-section relationship between elections 
and growth and highlights the strong cycli- 
cal pattern in election activity. Section II 


MARCH 1993 


presents the results of the vector autore- 
gressions and causality tests. The paper 
concludes with a discussion of our findings 
and their implications. 


I. The Data 


The events that we use to signify changes 
in union status at a firm are similar to those 
used by Ruback and Zimmerman (1984). 
These data are based on union representa- 
tion elections filed with the National Labor 
Relations Board (NLRB). The first step in 
the unionization process is to petition the 
NLRB to hold an election. A petition is 
successful if it contains signatures of at least 
30 percent of the employees to be covered 
by the new union. Successful petitions con- 
firm that an election will take place, usually 
within several months of the filing of the 
petition. Election results are certified by the 
NLRB, typically within five days of the ac- 
tual -lection. Following Ruback and Zim- 
merman (1984), we initially limit our sample 
to those elections involving 750 or more 
workers in NYSE firms between February 
1962 and December 1984. 

We match these data on union represen- 
tation elections to observations on firms’ 
behavior obtained from Standard and Poor’s 
COMPUSTAT data. These annual data, 
covering the period 1965-1984, are derived 
from the financial statements, 10-K reports, 
and other SEC filings of publicly traded 
firms and represent year-end values. We 
focus on the time-series behavior of a firm’s 
employment, investment in plant and equip- 
ment, and sales revenue.* We deleted firms 
from our sample if they did not report em- 
ployment, investment, or sales for any of the 
years from 1965 to 1984. 

There are a total of 85 firms for which we 
can match union election data (involving at 
least 750 workers) and a complete 20-year 
panel of employment, investment, and sales 
data from COMPUSTAT. The merging of 
these two data sets results in a sample of 
229 elections. Table 1 presents descriptive 


Sales and investment are measured in millions of 
1972 dollars, and employment is measured in thou- 
sands of workers. These values represent year-end 
totals /levels. 
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TABLE 1—UNION REPRESENTATION ELECTIONS 1962-1984 INVOLVING 
At Least 750 WorRKERS 


A. Distribution of Elections Across Firms 


Elections Number Percentage Union wins Number Percentage 
per firm of firms of firms per firm of firms of firms 


44 ! 57.65 
14 : | 27 31.76 

2 5.88 
2.35 
2.35 


18 


100.00 100.00 


B. Distribution of Elections Through Time 


Number Number 
Year of elections of wins 


1962 9 
1963 
1964 
1965 
1966 
1967 
1968 
1969 
1970 
1971 
1972 
1973 
1974 
1975 
1976 
1977 
1978 
1979 
1980 
1981 
1982 
1983 
1984 


2 
2 
2 
1 
2 
4 
3 
4 
2 
2 
4 
1 


worm 


tN 


Total: 229 


C. Size Distribution of Elections 


All elections Successful elections 


Number of Percentage of Number of Percentage of 
Percentile workers involved workers involved workers involved workers involved 


800 0.2 750 
1,035 995 
1,210 2.5 1,314 
1,717 5.2 1,729 
8,408 : 5,329 


51 
1,538 
864 


205 
4 
6 2.35 
7 2.35 
8 2.35 
9 3.53 
23 
Total: 85 
0 
) 
| 
0 
| 
51 
1 0.1 
25 0.7 
50 2.6 
75 5.2 
100 74.6 
N: 229 200 45 
Mean: 1,543 5.2 5.3 
SD: 965 9.0 11.3 


206 


THE AMERICAN ECONOMIC REVIEW 


MARCH 1993 


TABLE 2—SUMMARY STATISTICS FOR ELECTION AND NONELECTION FIRMS, 


1965-1984 
Firms where Firms where 
the union the union 
Variable won an election did not win Nonelection firms 
Number of firms 36 49 72 
Number of elections 128 101 0 
Number of wins 51 0 0 
Percentage of firms in 69 90 75 
manufacturing 
Sample means (standard deviations): 
Sales in $ millions 3,563.5 1,547.0 2,951.7 
(6,240.9) (1,661.9) (5,258.0) 
Investment in $ millions 261.7 107.1 246.1 
(492.5) (145.4) (510.0) 
Employment in 1,000’s 98.1 42.4 56.4 
(151.6) (39.2) (57.7) 
Log sales 7.38 6.76 7.20 
(1.20) (1.22) (1.28) 
Log investment 4.66 3.89 4.36 
(1.31) (1.41) (1.60) 
Log employment 3.84 321 3.48 
(1.22) (1.18) (1.19) 


statistics for the sample of union elections, 
including information on the distribution of 
elections across firms and through time and 
on the amount of employment involved in 
these elections. The table shows that the 
fraction of employment involved in these 
elections is skewed, though the distributions 
are very similar for all elections and for 
elections that the union wins. 

In order to obtain a control group with 
which to compare our sample of union-elec- 
tion firms, we selected firms traded on the 
New York Stock Exchange that did not ex- 
perience a union representation election in- 
volving 750 or more workers in the sample 
period. We again deleted all firms for which 
there were missing data concerning employ- 
ment, investment, or sales in the COMPUS- 
TAT data set. To limit the impact of dif- 
ferences in firm size across election and 
control groups a stratified sampling proce- 
dure was used. We first calculated the sam- 
ple quartiles of average employment over 
the period 1965-1984 for the 85 election 
firms we observed. Using these sample 
quartiles we divided the nonelection firms 


into four mutually exclusive groups based 
on their average employment during 1965- 
1984. We then obtained the largest possible 
random sample of nonelection firms given 
this stratification.4 The resulting control 
group includes 72 nonelection firms. 

Table 2 presents summary statistics for 
the election and nonelection subsamples. 
The election subsample is divided into those 
firms that experienced at least one success- 
ful election and those firms that did not. 
The industry composition of the nonelec- 
tion and winning-election firms is nearly 
identical, though losing-election firms are 


“The sample size of nonelection firms was based on 
the election-firm quartile that contained the smallest 
number of firms: there were 18 nonelection firms with 
average employment in the top quartile. After includ- 
ing all of these firms in our nonelection sample, we 
randomly sampled firms from the other quartiles to 
yield equal numbers of firms in each employment size 
group. It is noteworthy that the employment size distri- 
bution of potential control firms (i.e., no election, at 
least 750 workers, all 20 years of data) is very skewed 
compared to the distribution of election firms: there 
are only 18 firms in the top quartile and 408 in the 
bottom quartile. 
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TABLE 3—AVERAGE ANNUAL GROWTH RATES BY INDUSTRY FOR 
ELECTION AND NONELECTION Firms, 1965-1984 (PERCENTAGES; 
STANDARD DEVIATIONS IN PARENTHESES ) 


Firms with a 
union win 


Variable 


Firms without a Nonelection 
union win firms 


Manufacturing: 


Growth in sales 
Growth in investment 


Growth in employment 


Nonmanufacturing: 
Growth in sales 4.96 
(4.39) 


Growth in investment 5.80 
(5.92) 


Growth in employment 3.42 
(4.21) 


4.07 5.78 
(3.87) (3.11) 


5.58 5.44 
(4.98) (4.82) 


3.04 4.27 
(5.21) (3.44) 


“Difference from the growth rate for nonelection firms is significantly different 


from zero at the 0.05 level. 


Difference from the growth rate for firms without a union win is significantly 


different from zero at the 0.05 level. 


“Difference from the growth rate for firms with a union win is significantly different 


from zero at the 0.05 level. 


more likely to be in manufacturing. The 
average winning-election firm is larger than 
the average nonelection firm, which is larger 
than the average losing-election firm. Thus 
firms in the control group are roughly simi- 
lar to the election firms in terms of industry 
composition and size. 

Table 3 highlights cross-section differ- 
ences in the growth rates of employment, 
sales, and investment between winning-elec- 
tion, losing-election, and nonelection firms 
over the sample period. Within manufactur- 
ing, average growth rates in employment, 
investment, and sales are about 28-33 per- 
cent smaller in losing-election firms than in 
the control group of nonelection firms. The 
growth-rate differences between winning- 
election and nonelection manufacturing 
firms are even more pronounced. Employ- 
ment, investment, and sales growth rates 
are only about one-third to one-half as large 
in winning-election firms as they are in the 


control group of nonelection firms in the 
manufacturing sector.> 


°These results continue to hold if average growth 
rates are regressed on the size of the firm in 1965 and 
dummy variables for winning-election status, losing- 
election status, and durable manufacturing. Though 
firm size is an important (negative) determinant of 
growth and larger firms are more likely to experience a 
union win, winning-election firms still experienced sig- 
nificantly less growth over this period when we con- 
trolled for initial firm size. 

Three recent papers also report a negative cross- 
section relationship between unionization and employ- 
ment. David G. Blanchflower et al. (1991) analyze data 
on British establishments and conclude that union sta- 
tus is negatively related to employment growth, though 
Stephen J. Machin and Sushil B. Wadhwani (1991) 
dispute this. Jonathan S. Leonard (1992) finds that 
employment growth is lower by about 4 percentage 
points in union plants than in nonunion plants. Edward 
B. Montgomery (1989) uses CPS data and finds a very 
weak relationship between employment status and 
unionization. 


| 207 
2.64? 3.77 4.82° 
(2.64) (3.28) (3.13) 
1.79% 4.04° 5.19° 
(3.87) (4.55) (5.38) 
0.81" 2.21 2.94° 
(2.86) (3.35) (3.55) 
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TABLE 4—AVERAGE GNP GRowTH AND AVERAGE ELECTION-FIRM SALES GROWTH 
AROUND UNION ELECTIONS (PERCENTAGES; STANDARD ERRORS 
IN PARENTHESES ) 


GNP growth 


If elections were 


Average sales 


Observed — uniformly distributed growth at 

Time elections over time Difference — election firms 

Two years before Rho | 2.83 0.74** 4.35 
an election (0.18) 

One year before 3.52 2.80 Ociz"* 4.01 
an election (0.18) 

Year of an 2.75 2.85 —0.10 4.14 
election (0.18) 

One year after 2.10 2.74 —().64** 1.65 
an election (0.18) 

Two years after 2.14 2.60 —(0.46** 2.76 
an election (0.17) 


Note: The F test that the first two columns are equal can be rejected at the 0.01 level. 


See the text for further details. 


**Significantly different from zero at the 0.01 level. 


This negative cross-section relationship 
between average growth rates and union 
elections is not apparent in nonmanufactur- 
ing firms. In general the differences in 
growth rates across nonelection, losing-elec- 
tion, and winning-election firms are rela- 
tively minor and are always statistically in- 
significant. 

Two interesting features of the time-series 
behavior of union organizing activity are 
apparent from Table 1. First, the probabil- 
ity of observing a union election decreases 
sharply over time. Second, representation- 
election propensities appear to be highest 
near the peak of the business cycle, 1-2 
years before major cyclical contractions. 
Table 4 provides more direct evidence of 
this cyclical pattern. For each election in 
our sample, we calculate annual GNP 
growth for the year of the election and for 
the two years immediately before and im- 
mediately after the election. 

The first column of Table 4 presents aver- 
ages of these five GNP growth rates across 
the 200 elections that occurred during 
1965-1984. GNP growth was noticeably 
higher on average in the years just before 
an election than in the years just after an 
election. For comparison, the average GNP 


growth rates that would arise if these elec- 
tions were distributed uniformly over 
1965-1984 are presented in the second col- 
umn of Table 4. The third column reports 
the difference between the first two columns 
with associated standard errors in order to 
test the null hypothesis that these large 
union elections are uncorrelated with the 
business cycle.° The results indicate that 
GNP growth was significantly above average 
before elections and was significantly below 
average following elections.’ 


°Given that elections became increasingly rare over 
time, we also compared the observed GNP growth 
rates around elections to those that would arise if 
elections were distributed along a simple negative 
trend. Again we could reject this hypothesis in favor of 
a procyclical pattern. 

Orley C. Ashenfelter and John H. Pencavel (1969) 
argue that organizing is generally procyclical. As em- 
ployment expands, the pool of unorganized workers 
increases, and representation elections are more likely 
to occur. In addition, union organizing drives may be 
more likely during expansions because there is both 
more to be won (i.e., profits are higher) and less to be 
lost (i.e., labor markets are tighter). Interestingly, the 
results here and in Tables 5 and 6 suggest that aggre- 
gate growth has a much stronger effect on organizing 
activity at a firm than does the firm’s own growth. 
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The fourth column of Table 4 presents 
average sales growth in the election firms 
for the corresponding years surrounding an 
election.* Similar to the pattern of growth 
in aggregate output, growth in the firms that 
experience elections is higher before and 
lower after an election. Thus, observed dif- 
ferences in firm behavior before and after 
union elections may largely reflect effects of 
the business cycle, rather than the impact of 
election activity. 

The simple cross-section and time-series 
relationships depicted in Tables 3 and 4 
imply that: (i) average growth rates between 
1965 and 1984 were significantly negatively 
related to union election success, at least in 
the manufacturing sector; and (ii) union 
election propensities were the highest at the 
peak of cyclical expansions. In the next sec- 
tion we develop a more formal empirical 
analysis with these facts in mind. 


II. Vector Autoregressions of Union 
Elections and Firm Growth 


The cross-section results described above 
may simply be due to unobserved hetero- 
geneity in growth rates that is correlated 
with the propensity for union elections. In 
this section, we develop and estimate a dy- 
namic model of union elections and firm 
growth that allows for firm-specific hetero- 
geneity in growth rates and election propen- 
sities. Given the strong cyclical pattern of 
election activity, we control for aggregate 
growth by allowing separate year effects as 
well as by including the nonelection firms in 
the regressions. We then perform hypothe- 
sis tests on these vector autoregressions 
(VAR’s) in an attempt to uncover whether 


“Average sales growth includes only those firm /years 
that were within two years of an election. This can be 
most easily calculated by regressing the 1,615 election- 
firm observations (19 years [one is lost in calculating 
growth rates] 85 firms) on a constant, an indicator 
variable for the occurrence of an election, and two 
leads and two lags of this indicator. The patterns of 
sales growth and GNP growth are quite similar to 
those reported if the sample is divided into manufac- 
turing-sector and nonmanufacturing-sector elections. 
There are also similar patterns of growth in employ- 
ment and capital investment in the election firms. 
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increases in firm growth influence union 
organizing activity and whether union elec- 
tions affect subsequent firm growth. 

Consider the following cross-section / 
time-series model of a firm’s input and out- 
put decisions and union organizing activity 
at the firm: 


M 
(1) = a;+0,+ 
m=1 


M 
=] 


m= 


where Y,, represents the annual percentage 
change (log difference) in firm i’s sales, 
employment, or investment in plant and 
equipment. The coefficients B,,, measure the 
impact of lagged changes in the growth of 
inputs or output on the current changes in 
inputs or output. The vector X;, contains 
two variables measuring the union election 
activity at firm 7 in period f: the fraction of 
the firm’s employment involved in unsuc- 
cessful union elections and the fraction in 
successful elections.? The coefficients y,, 
measure the impact of past union organiz- 
ing activity on the time-series pattern of 
sales, employment, or investment at the firm. 
The parameters a; allow for differences in 
the mean growth rates of sales, employ- 
ment, or investment across firms, while the 
6, allow for separate year effects. The lag 
length, equal to M, is sufficient to capture 
all of the impact of lagged Y and X on 
current Y; hence, the error term ¢;, is a 
zero-mean serially uncorrelated random 
variable. 


‘We also performed the analysis using simple 
dummy variables as the measure of union election 
activity. These results were similar to (though some- 
what weaker than) those reported. In Bronars and 
Deere (1990), we found that the fraction of firm em- 
ployment involved in the election is an important de- 
terminant of the equity loss associated with the elec- 
tion; hence, we prefer the use of the employment 
fractions. 
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The time-series behavior of X;, can be 
modeled in a similar fashion: 


(2) X,,=«,+a,+ 


m**it—m 


+ it—m +p 


m=1 


it 


where the coefficients 6,,, measure the im- 
pact of lagged union elections on current 
elections, and the coefficients 7, measure 
the impact of lagged sales, employment, or 
investment on union elections. The parame- 
ters x; and A, allow union election activity 
to differ across firms and over time, respec- 
tively. Again the lag length M ensures that 
the error term v;, is serially uncorrelated. 
Analysis of the estimated coefficient vec- 
tors, y and tw, provides the opportunity to 
test for Granger-causality between elections 
and growth. 

Equations (1) and (2) specify a complete 
dynamic relationship between elections and 
firm growth. The structure of this relation- 
ship may, however, differ across groups of 
firms. For example, as was evident in the 
preceding section, the relationship between 
union election activity and average firm 
growth appears to differ across major indus- 
try groups. In view of this, we estimate 
equations (1) and (2) separately for manu- 
facturing and nonmanufacturing firms.'” 


There are 69 election firms (with 180 elections) in 
manufacturing and 16 election firms (with 49 elections) 
in nonmanufacturing. There are 54 manufacturing and 
18 nonmanufacturing firms in the nonelection control 
group. 

We also estimated a variety of other specifications. 
Pooling all firms across industries gave no evidence of 
any causal relationships. If those firms that experi- 
enced only a single election (35 in manufacturing and 
nine in nonmanufacturing) were included along with 
the nonelection firms, the results are similar to and 
slightly stronger than those reported. The variation in 
GNP growth around these single elections is also more 
pronounced than that reported in Table 4, however. 
We estimated VAR’s that included lagged sales, invest- 
ment, and employment growth together as regressors. 
These results were in close agreement with those re- 
ported. Lastly, because the vast majority of elections 
involve fewer than 750 workers, we also examined the 
complete union-election histories for the firms in our 
sampie over the period 1977-1984 (the NLRB data 
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It is important to note that ordinary 
least-squares (OLS) estimation of (1) and 
(2) may lead to biased coefficient estimates 
because of the presence of lagged endoge- 
nous variables (Stephen J. Nickell, 1981). 
Given that our panel is rather long, such 
biases may be small. An alternative esti- 
mation strategy, which eliminates this bias 
via a generalized two-stage least-squares 
(2SLS-GLS) procedure, is suggested by 
Douglas Holtz-Eakin et al. (1988). We per- 
formed both estimation procedures. A com- 
parison of the OLS and 2SLS-GLS results 
reveals that the OLS estimates of the own- 
lag effects do differ significantly, but pre- 
dictably, from the 2SLS-GLS estimates. 
Hence the Nickell (1981) bias is present and 
is roughly equal to —1/(T —1), where T = 
16 is the effective length of the panel after 
lagging. Despite these differences in own-lag 
effects, the estimates of the zross-effects are 
quite similar. The implicatiuns for causality 
are therefore the same using either estima- 
tion procedure. Given this comparison, we 
report the 2SLS-GLS results in Tables 5 
and 6. For completeness, the OLS results 
are given in the Appendix, along with a 
discussion of the data sources and method- 
ology. We turn now to a discussion of the 
results. 

The estimates of (1) reported in Table 5 
provide some evidence that union election 
activity causes the firm’s subsequent growth 
to change. Employment and sales growth 
decline following a losing election in manu- 
facturing. An election involving 3 percent of 
the firm’s workers decreases growth by about 
0.5—0.75 percentage points in the next three 
years. This effect is somewhat stronger and 
of greater statistical significance for sales 
growth. In contrast to the negative effect of 


prior to 1977 are very difficult if not impossible to 
match to COMPUSTAT data). These 157 firms experi- 
enced 1,541 elections during 1977-1984 of which 642 
were successful for the union. Of the 72 firms originally 
designated nonelection, 52 experienced an election in- 
volving fewer than 750 workers during 1977-1984. We 
estimated VAR’s similar to those reported in Tables 5 
and 6, and also measured the election effects with 
simple dummy variables. These additional estimates 
were generally supportive of our findings: the esti- 
mated effects varied across industries, and there was 
little evidence of union-win effects. 


M 
m=1 
M 
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TABLE 5—VeEcTOR AUTOREGRESSIONS (2SLS-GLS): GrowTH 


Dependent variable 


Growth in Growth in Growth in 
employment investment sales 


Explanatory variables Coefficient SE Coefficient SE Coefficient SE 


Firms in Manufacturing: 


Lag-1 union loss — 1.77 
Lag-2 union loss — 2.64 
Lag-3 union loss — 1.36 
Lag-1 union win — 2.54 
Lag-2 union win — ().84 
Lag-3 union win —().76 
Lag-1 employment 0.13 
Lag-2 employment 0.002 
Lag-3 employment — (0.04 
Lag-1 investment 

Lag-2 investment 

Lag-3 investment 

Lag-1 sales 

Lag-2 sales 

Lag-3 sales 


tout 
Mn 
We Wn un 


F tests (p values in parentheses): 


All union-loss 1.90 0.16 225 
coefficients (0.127) (0.921) (0.080) 
All union-win 0.08 0.19 
coefficients (0.969) (0.903) 
All election 1.01 0.18 
coefficients (0.419) (0.983) 


Firms in Nonmanufacturing: 


Lag-1 union loss 11 10.09 
Lag-2 union loss Xs 10.12 
Lag-3 union loss .28 9.99 
Lag-1 union win 95 21.91 
Lag-2 union win 9S 21.91 
Lag-3 union win : : 9.3 5.61 
Lag-1 employment 

Lag-2 employment 

Lag-3 employment 

Lag-1 investment 

Lag-2 investment 

Lag-3 investment 

Lag-1 sales 

Lag-2 sales 

Lag-3 sales 


F tests (p values in parentheses): 


All union-loss . 1.53 6.81 
coefficients 175 (0.206) (0.001) 

All union-win ; 1.37 0.30 
coefficients (0.252) (0.829) 

All election F 1.45 3.58 
coefficients (0.319) (0.195) (0.002) 


Note: Each regression also included a complete set of year dummy variables. The 
coefficients on the union variables have been multiplied by 10. For firms in manufac- 
turing, N = 1,845; for firms in nonmanufacturing, N = 510. 
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2.85 4.39 —2.47 1.20 
53 4.40 — 1.78 1.20 
).31 4.39 — 2.06 1.72 
3.25 20.46 — 3.32 5.60 
5.76 13.03 — 1.49 3.57 
65 12.38 1.28 3.39 
(0.08 = 
0.02 = = = 
0.02 = 
= 9 0.02 = = 
= ).23 0.02 — — 
— 12 0.02 — 
= 0.21 0.03 
= = —0.02 0.02 
Es = — 0.04 0.02 
9.23 2.79 
9.72 2.82 
5.33 2.80 
—2.13 6.03 
2.48 6.03 
1.11 1.50 
0.17 0.05 
— (0.02 0.05 
0.20 0.05 
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TABLE 6— VECTOR AUTOREGRESSIONS (2SLS-GLS): ELECTIONS 


(STANDARD ERRORS IN PARENTHESES) 


Dependent variables 


Elections lost by the union 


Elections won by the union 


Explanatory variables (i) (ii) (iii) (i) (ii) (iii) 
Firms in Manufacturing: 
Lag-1 union loss —3.35 — 3.28 — 3.33 0.16 0.19 0.22 
(2.48) (2.49) (2.49) (0.53) (0.53) (0.53) 
Lag-2 union loss — 2.32 — 2.25 — 2.37 0.51 0.56 0.51 
(2.49) (2.49) (2.49) (0.53) (0.53) (0.53) 
Lag-3 union loss —3.32 — 3.46 —3.19 — 0.06 — 0.04 — 0.02 
(2.49) (2.49) (2.49) (0.53) (0.53) (0.53) 
Lag-1 union win — 6.55 — 6.12 — 6.10 — 1.40 — 1.44 — 1.37 
(11.60) (11.59) (11.59) (2.45) (2.45) (2.45) 
Lag-2 union win 1.93 2.00 1.71 — 0.79 — 0.69 — 0.86 
(7.38) (7.38) (7.38) (1.56) (1.56) (1.56) 
Lag-3 union win — 1.64 — 2.02 — 1.56 — 1.48 — 1.53 —1.51 
(7.02) (7.02) (7.01) (1.48) (1.48) (1.48) 
Lag-1 employment — 0.33 — — 0.83 
(5.32) (1.12) 
Lag-2 employment — 3.31 0.14 — 
(4.89) (1.03) 
Lag-3 employment — 0.38 — — — 1.02 — — 
(4.69) (1.99) 
Lag-1 investment 0.62 — — 0.23 
(1.41) (0.30) 
Lag-2 investment 0.03 0.27 
(1.41) (0.30) 
Lag-3 investment — 1.90 — —0.10 
(1.41) (0.30) 
Lag-1 sales 0.17 —2.11 
(5.71) (1.21) 
Lag-2 sales 3.92 0.66 
(5.07) (1.07) 
Lag-3 sales —6.13 — 1.85 
(4.96) (1.05) 
F tests (p values in parentheses): 
All employment 0.16 — — 0.49 — — 
coefficients (0.926) (0.689) 
All investment 0.87 — 0.56 — 
coefficients (0.459) (0.643) 
All sales coefficients 0.72 — — 1.90 
(0.543) (0.128) 
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TABLE 6— Continued 


Dependent variables 


Elections lost by the union 


Elections won by the union 


Explanatory variables (i) (ii) (iii) (i) (ii) (iii) 
Firms in Nonmanufacturing: 
Lag-1 union loss 1.61 1.50 1.33 0.19 0.25 0.33 
(4.87) (4.81) (4.83) (0.68) (0.68) (0.68) 
Lag-2 union loss 3.08 yA 2.31 0.33 0.26 0.33 
(4.86) (4.86) (4.88) (0.68) (0.68) (0.69) 
Lag-3 union loss 3.63 Wa 2.86 —().22 — 0.20 —0.22 
(4.79) (4.76) (4.85) (0.67) (0.67) (0.68) 
Lag-1 union win — 1.84 — 2.07 1.80 —0.19 — 0.25 — 0.22 
(10.45) (10.44) =(10.44) (1.47) (1.47) (1.47) 
Lag-2 union win 3.25 3.60 3.05 — 0.50 — 0.56 — 0.47 
(10.42) (10.45) (10.44) (1.46) (1.47) (1.47) 
Lag-3 union win 0.39 0.75 0.55 0.01 — 0.08 —().03 
(2.55) (2.67) (2.60) (0.36) (0.38) (0.37) 
Lag-1 employment — 7.92 1.82 
(10.79) (1.51) 
Lag-2 employment — 14.30 — 1.18 a 
(9.58) (1.34) 
Lag-3 employment 1.80 — 0.60 — — 
(8.93) (1.25) 
Lag-1 investment —2.14 0.42 
(2.40) (0.34) 
Lag-2 investment —0.11 0.15 
(2.46) (0.35) 
Lag-3 investment —1.18 —- 0.08 — 
(2.32) (0.33) 
Lag-1 sales 1.66 —0.12 
(9.42) (1.32) 
Lag-2 sales 4 — (0.14 
(8.90) (1.25) 
Lag-3 sales — 11.66 — _ — 0.16 
(8.80) (1.24) 
F tests (p values in parentheses): 
All employment 0.87 — 1.01 — — 
coefficients (0.455) (0.386) 
All investment 0.37 — 0.52 
coefficients (0.773) (0.671) 
All sales coefficients — —_ 0.68 — — 0.01 
(0.565) (0.999) 


Note: Each regression also included a complete set of year dummy variables. The 
coefficients on the election variables and the growth variables have been multiplied by 
100 and 1,000, respectively. For firms in manufacturing, N = 1,845; for firms in 
nonmanufacturing, N = 510. 
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elections in manufacturing, growth in- 
creases after a losing election in nonmanu- 
facturing firms. Growth is 1-5 percentage 
points higher in the three years following a 
losing election involving 3 percent of the 
firm’s employees. The effect on sales growth 
is quite statistically significant, whereas the 
effects on employment and investment 
growth are marginally significant at best. 
Despite these results for losing elections, 
however, there is virtually no evidence that 
successful union elections have any effect 
on firm growth in either industry.'! 

The formal statistical tests for causality 
entail examining whether changes in union 
election activity help predict firm growth 
given past growth. The joint F tests that all 
three lags of the election variables equal 
zero are also presented in Table 4. Non- 
causality can be rejected in only two cases: 
the effect of losing elections on sales growth 
in each industry. There is little evidence of 
any other causal effects and no indication 
that union wins have any significant impact 
on firm growth. Although these regressions 
provide some statistical evidence that union 
elections help predict firm behavior, the lack 
of any union-win effect, coupled with the 
quite different effects for union losses across 
industries, renders suspect the conclusion 
that elections truly cause growth changes. 

The evidence that firm growth causes 
election activity is even less compelling. The 
estimates of (2) in Table 6 provide scant 
evidence that changes in growth are fol- 
lowed by any measurable change in election 
activity. In fact, the only significant coeffi- 
cients (at the 0.10 level) on a growth vari- 
able in all of Table 6 indicate that a shock 
to sales growth has a slight positive effect on 
successful election activity at manufacturing 
firms.'2 The remainder of the election 


"'The lone exception is a weak negative effect of 
winning elections on investment growth three years 
later in nonmanufacturing firms. 

“The mean number of successful elections per 
firm/year is about 0.026, while the typical (median) 
fraction of employment involved in successful elections 
is 0.026. Multiplying these gives an estimated probabil- 
ity of 0.00068 that a typical worker is involved in a 
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VAR’s give no indication that innovations 
in a firm’s growth have any impact on union 
election activity at that firm. The F tests for 
causality imply that the hypothesis of non- 
causality from growth to elections cannot be 
rejected in any of the specifications. 

The general implication of the VAR’s is 
that there is only a weak relationship be- 
tween union organizing activity and firm 
growth. In addition, the only rejections of 
noncausality imply that losing elections have 
opposite effects on growth in manufacturing 
and nonmanufacturing firms. Most impor- 
tantly, there is no evidence that union suc- 
cess affects growth. 

At this point it is instructive to consider 
why we find so little evidence of a causal 
link between union elections and firm 
growth. The most obvious candidate is the 
confounding effect of the business cycle. If 
firms react differently to cyclical shocks, 
then, even though we include controls for 
aggregate growth, the strong cyclical pattern 
of union elections makes it difficult to ob- 
tain precise estimates of the relationship 
between changes in unionization and 
changes in growth. In essence the econome- 
trician is attempting to infer whether a firm’s 
behavior would have been different over a 
certain range of the business cycle had it 
not experienced a union organizing drive. If 
there is sufficient heterogeneity in cyclical 
sensitivity across firms, more precise esti- 
mates of this effect can only be obtained by 
observing the same firm’s behavior over a 
repeated number of business cycles. Given 
the opposite signs of the effects on growth 
and the similar cyclical election patterns 
across industries, however, this differing- 
sensitivity hypothesis requires that manufac- 
turing firms be more sensitive and that non- 
manufacturing firms be less sensitive to the 


union success in a given year. The estimates in Table 6 
imply that a 3-percentage-point increase in annual 
sales growth, which is approximately the mean sales 
growth, would increase this probability to 0.00074, or 
by about one-tenth. Thus, this lone significant effect is 
quite small in magnitude. 
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business cycle when they experience a union 
election.'* 

The fact that there are relatively few suc- 
cessful elections in the sample may also 
make it difficult to obtain precise estimates 
of union-win effects in either industry.'* It 
is important to note, however, that Ruback 
and Zimmerman (1984) found much larger 
and stronger equity effects for union wins 
than for union losses in this sample. More- 
over, these estimated equity effects did not 
differ much across industries (Ruback and 
Zimmerman (1984 pp. 1154-5). 

Finally, it is possible that a firm may not 
respond much to the occurrence of a union 
election, whether a win or loss, because it 
has already adjusted its behavior given the 
probability of unionization that if faces. Re- 
cently, Abowd (1989b) and Bronars and 
Deere (1991) have provided some evidence 
that differences in firm behavior are indeed 
associated with differences in the likelihood 
of unionization. 


III. Conclusions 


This paper began by noting that if firms 
alter their behavior in response to union 
elections then the process of new unioniza- 
tion is inefficient. Such changes in behavior 
imply that the associated loss to sharehold- 
ers exceeds the gain to the union. Growing 
firms are more attractive targets of union- 


'3Richard B. Freeman and Morris M. Kleiner (1990) 
compare firm behavior before and after union elections 
using a sample of elections from the 1980’s and find 
modest declines in employment levels following a union 
election. They also include a set of control establish- 
ments and year dummy variables, but as with our 
results, it is not clear that this controls sufficiently for 
the influence of the business cycle. 

“In addition, the fact that we limit our analysis to 
firms for which the full 20-year panel of data are 
available may lessen the chances of finding election 
effects. If business failures follow a union success, then 
our estimates understate the impact of elections. Lastly, 
there may be important intrafirm effects of unioniza- 
tion that our firm-level data cannot detect. For exam- 
ple, the unionization of one plant in a multiestablish- 
ment firm may result in a significant shift of production 
activity to nonunion plants with little overali effect on 
the firm. If such were the case, however, it is not clear 
why there would be large and permanent effects on the 
firm’s equity value. 
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ization because of the greater rents and 
quasi-rents available to shareholders, but 
they may be less susceptible to unionization 
because of their ability to maintain higher 
wages and greater job security. Understand- 
ing of the causes and consequences of 
unionization can be enhanced by a careful 
econometric examination of the time-series 
relationship between union organizing activ- 
ity and changes in input and output growth 
at the firm level. 

Our analysis suggests that the likelihood 
of a union election is quite sensitive to 
aggregate economic activity, but essentially 
unrelated to a firm’s own recent changes in 
output or inputs. Knowledge of past sales, 
employment, and investment changes across 
firms are of little help in predicting when or 
where union elections will occur. Our analy- 
sis also shows that after controlling for gen- 
eral time-period effects, union election ac- 
tivity has a weak effect on firm behavior. 
Sales growth does change somewhat follow- 
ing a union election, but these estimated 
effects are only apparent when unions are 
unsuccessful in their attempt to organize, 
and they are opposite in sign for manufac- 
turing and nonmanufacturing firms. 

These results lead us to the cautious con- 
clusion that the observed changes in a firm’s 
growth that follow a union election have 
little to do with unionization itself. We do 
note, however, that there is a significant, 
negative cross-section relationship between 
successful elections and firm growth in our 
sample. It is possible that this relationship is 
causal but that the confounding effects of 
the business cycle prevent us from uncover- 
ing any significant effects of elections. From 
this we conclude that cross-section analyses 
of unionization and firm behavior should be 
viewed with some skepticism. 

Our results also suggest that the large 
equity losses uncovered by Ruback and 
Zimmerman (1984), for these same elec- 
tions, are unrelated to systematic changes in 
firms’ inputs or output. Ruback and Zim- 
merman (1984) found an average 4-percent 
equity loss during the month of a successful 
union election, and we find no evidence of 
effects on growth for these election events. 
We cannot reject the hypothesis that the 


equity losses for these elections represent a 
simple transfer of wealth from shareholders 
to workers. This finding for changes in 
unionization is consistent with the conclu- 
sion of Abowd (1989a) and Clark (1984) 
that negotiations between an existing union 
and firm lead to efficient wealth transfers. 


APPENDIX 


The data used in the analysis were ob- 
tained from three sources. Richard Rutack 
and Martin Zimmerman kindly provided us 
with the data used in their 1984 paper. 
These data included information on the firm 
in which the election was held, the petition 
and certification months, the number of 
workers involved in the election, and the 
outcome of the election. We supplemented 
these data by extending the time period 
through 1984 and by including every elec- 
tion that occurred at all 157 firms in our 
sample during the period 1977-1984. The 
supplemental election data were derived 
from the election data tapes maintained by 
the National Labor Relations Board and, 
on occasion, from the published Monthly 
Election Reports of the NLRB. The data on 
sales, employment, and investment as well 
as industry attachment were taken directly 
from the COMPUSTAT data tapes. All data 
used in the paper are available from the 
authors upon request. 

We give here a brief description of the 
2SLS-GLS estimation procedure used for 
Tables 5 and 6. Further details and the 
programs used are available from the au- 
thors upon request. We take first differ- 
ences of equations (1) and (2) to remove the 
firm-specific effects. Differencing induces 
serial correlation in the error terms, and it 
also creates a correlation between AY,,_, 
and Ae,, and between AX;,_, and Av,, due 
to the shared terms dated ¢ —1. The serial 
correlation of the errors was addressed by 
using a generalized least-squares (GLS) 
transformation to obtain a scalar covariance 
matrix, and the correlation between the er- 
rors and regressors was addressed by utiliz- 
ing an instrumental-variables procedure (on 
the transformed equations). As suggested by 
Holtz-Eakin et al. (1988), we utilize lags of 
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Y and X as instruments for the differenced 
and transformed variables. 

The GLS transformation required knowl- 
edge of the covariance matrices for Ae and 
Av. Initial consistent coefficient estimates 
were obtained from applying two-stage least 
squares to the differenced versions of (1) 
and (2) where year dummies and lags 2 
through M+1 of Y and X were used as 
the instrument set. The resulting residuals 
were then used to estimate the covariance 
matrices. The differenced variables were 
then transformed using the results of a 
Cholesky-type decomposition of the esti- 
mated covariance matrices to yield trans- 
formed equations with scalar covariance 
matrices. We assumed that the correlation 
of errors across firms is zero given the inclu- 
sion of time effects; hence, the covariance 
matrices are block-diagonal with identical 
blocks. 

The last step in obtaining the reported 
estimates was to apply two-stage least 
squares to the differenced and then trans- 
formed versions of (1) and (2), where the 
instrument set again comprised year dum- 
mies plus lags 2 through M +1 of Y and X. 
The appropriate value of the lag length M 
was obtained by performing F tests on suc- 
cessive lag lengths, beginning with M = 4. 
We selected M=3 as the final specifica- 
tion. 

Lastly, Tables Al and A2 report the OLS 
estimates of equations (1) and (2) for com- 
parison with Tables 5 and 6 in the text and 
for greater ease of replication. 
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Tas_eE Al—Vector AUTOREGRESSIONS (OLS): GRowTH 
Dependent variables 
Growth in Growth in Growth in 
employment investment sales 
Explanatory variables Coefficient SE Coefficient SE Coefhcient SE 
Firms in Manufacturing: 
Lag-1 union loss 1.61 1.17 3.80 4.13 —2.15 1.12 
Lag-2 union loss — 2.45 1.17 0.66 4.12 — 1.60 1.12 
Lag-3 union loss — 1.26 1.17 ).13 4.12 — 1.90 1.12 
Lag-1 union win — 3.78 5.65 2.90 19.89 —3.01 5.40 
Lag-2 union win —1.31 3.63 5.47 12.80 — 1.63 3.47 
Lag-3 union win — 1.17 3.48 0.08 12.27 1.08 3.33 
Lag-1 employment 0.05 0.02 
Lag-2 employment — 0.05 0.02 
Lag-3 employment —().09 0.02 
Lag-1 investment ).23 0.02 
Lag-2 investment — 0 0.02 
Lag-3 investment - 0.02 
Lag-1 sales — — 0.13 0.02 
Lag-2 sales _ - — —0.09 0.02 
Lag-3 sales - — - —0.10 0.02 
F tests (p values in parentheses): 
All union-loss 2.10 0.30 2.40 
coefficients (0.098) (0.824) (0.066) 
All union-win 0.21 0.19 0.22 
coefficients (0.890) (0.901) (0.901) 
All election 1.17 0.25 1.33 
coefficients (0.318) (0.961) (0.242) 
Firms in Nonmanufacturing: 
Lag-1 union loss 1.27 2.18 8.18 9.56 6.62 2.61 
Lag-2 union loss 1.54 2.16 5.55 9.54 7.88 2.61 
Lag-3 union loss 1.76 2.11 .8 9.32 4.68 2.56 
Lag-1 union win py 4.9] 2 21.70 —3.15 5.91 
Lag-2 union win 3.31 4.89 04 21.71 1.59 5.90 
Lag-3 union win — 1.26 1.20 10.01 5.56 0.34 1.46 
Lag-1 employment 0.07 0.04 — = = 
Lag-2 employment 0.03 0.04 
Lag-3 employment 0.01 0.04 = 
Lag-1 investment ) 0.05 — 
Lag-2 investment ).1 0.05 
Lag-3 investment 1.06 0.05 
Lag-1 sales — — — 0.11 0.05 
Lag-2 sales — — -9.11 0.04 
Lag-3 sales — — 0.07 0.05 
F tests (p values in parentheses): 
All union loss 0.46 1.42 bs 
coefficients (0.710) (0.237) (0.001) 
All union win 0.63 1.61 0.15 
coefficients (0.594) (0.186) (0.931) 
All election 0.55 1.53 2.86 
coefficients (0.769) (0.168) (0.010) 


Notes: Each regression also included a complete set of year and firm dummy variables. 
The coefficients on the union variables have been multiplied by 10. For firms in 
manufacturing, N = 1,968; for firms in nonmanufacturing, N = 544. 
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A2—Vector AUTOREGRESSIONS (OLS): ELECTIONS 
(STANDARD ERRORS IN PARENTHESES) 


Dependent variables 


Elections lost by the union Elections won by the union 
Explanatory variables (i) (ii) (iii) (i) (ii) (iii) 
Firms in Manufacturing: 
Lag-1 union loss — 11.64 — 11.59 —11.63 0.51 0.54 0.52 
(2.32) (2.32) (2.32) (0.49) (0.49) (0.49) 
Lag-2 union loss — 10.73 — 10.66 — 10.76 0.99 1.04 0.97 
(2.32) (2.32) (2.32) (0.49) (0.49) (0.49) 
Lag-3 union loss — 11.75 — 11.88 — 11.61 0.18 0.04 0.20 
(2.32) (2.32) (2.32) (0.49) (0.49) (0.49) 
Lag-1 union win —5.39 — 5.00 — 5.00 —7.92 —7.88 — 7.88 
(11.20) (11.20) (11.20) (2.38) (2.38) (2.38) 
Lag-2 union win 1.81 1.92 1.71 — 3.68 —3.56 — 3.69 
(7.21) (7.21) (7.20) (1.53) (1.53) (1.53) 
Lag-3 union win — 1.96 —2.28 — 1.81 — 4.00 — 4.06 — 3.99 
(6.91) (6.91) (6.91) (1.47) (1.47) (1.47) 
Lag-1 employment — 0.61 0.24 
(4.57) (0.97) 
Lag-2 employment —1.27 0.63 
(4.44) (0.94) 
Lag-3 employment 0.24 — 0.74 
(4.39) (0.93) 
Lag-1 investment 0.38 0.33 
(1.29) (0.27) 
Lag-2 investment — 0.03 0.26 
(1.29) (0.27) 
(1.32) (0.28) 
Lag-1 sales — 1.08 —0.42 
(4.84) (1.03) 
Lag-2 sales 5.02 1.69 
(4.60) (0.98) 
Lag-3 sales — 4.61 — 1.26 
(4.58) (0.97) 


F tests (p values in parentheses): 


All employment 0.04 0.36 
coefficients (0.991) (0.785) 

All investment 0.68 0.98 
coefficients (0.562) (0.402) 

All sales coefficients — 0.71 


(0.547) (0.251) 
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TaBLeE A2— Continued 


Dependent variables 


Elections lost by the union Elections won by the union 
Explanatory variables (i) (ii) (iii) (i) (ii) (iii) 


Firms in Nonmanufacturing: 


Lag-1 union loss — 6.94 — 6.54 — 6.73 — 0.04 0.01 
(4.57) (4.54) (4.55) (0.65) (0.64) 
Lag-2 union loss —5.29 —5.99 — 5.55 0.11 0.04 
(4.53) (4.53) (4.55) (0.64) (0.64) 
Lag-3 union loss — 4,29 — 4.08 — 4.23 — 0.42 —0.41 
(4.43) (4.43) (4.46) (0.63) (0.63) 
Lag-1 union win — 1.32 — 1.16 — 1.35 — 0.92 — 0.96 
(10.30) (10.31) (10.30) (1.46) (1.46) 
Lag-2 union win 3.75 4.22 — 1.21 — 1.26 
(10.27) (10.31) (1.45) (1.46) 
Lag-3 union win 0.40 0.53 — 0.04 —0.13 
(2.53) (2.64) 55 (0.36 (0.37) 
Lag-1 employment — 1.28 LZ 
(9.02) (1.28) 
Lag-2 employment — 13.84 - - 1.21 
(8.40) (1.19) 
Lag-3 employment —0.89 _ 0.46 
(8.31) (1.18) 
Lag-1 investment - 0.40 
(0.31) 
Lag-2 investment - 0.13 
(0.32) 
Lag-3 investment 0.04 
(0.31) 
Lag-1 sales — 0.65 = 
(7.95) 
Lag-2 sales — 8.55 
(7.82) 
Lag-3 sales 4.84 
(8.01) 


F tests (p values in parentheses): 


All employment 0.92 - 1.01 
coefficients (0.432) (0.388) 
All investment 0.29 — — 0.56 — 
coefficients (0.836) (0.643) 
All sales coefficients — 0.50 — 0.02 
(0.686) (0.995) 


Notes: Each regression also included a complete set of year and firm dummy variables. 
The coefficients on the election and growth variables have been multiplied by 100 and 
1,000, respectively. For firms in manufacturing, N = 1,968; for firms in nonmanufactur- 
ing, N = 544. 
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0.06 
(0.64) 
0.07 
(0.64) 
—0.34 
(0.63) 
—0.94 
(1.46) 
—1.19 
(1.46) 
-0.07 
(0.36) 
0.01 
(1.13) 
— 0.16 
(1.11) 
—0.23 
(1.14) 
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Seniority and Monopsony in the Academic Labor Market 


By MIcHAEL R RANsoM* 


Workers with high seniority usually have higher wages than workers with low 
seniority and the same total labor-market experience. In contrast, the results of 
this paper indicate that for university professors higher seniority is associated 
with lower salaries. | document this finding for national surveys and individual 
institutions and explain it as due to monopsonistic discrimination by employers. 


(JEL J31, J44) 


An idea that is central to many theories 
about the life-cycle structure of pay is that 
workers with high seniority are paid more 
than otherwise similar workers with low se- 
niority. As an example of the types of re- 
turns to seniority observed in cross-sectional 
studies, Table 1 reports results of a regres- 
sion analysis of various occupational and 
educational groups from a recent sample of 
adult workers in the United States, the May 
1988 Current Population Survey (CPS).' For 
all workers, earnings initially increase by 
about 3 percent with each additional year of 
seniority. For a worker with 10 years of 
seniority (sample average seniority is 9.8 
years), an additional year of seniority is 
associated with a 2-percent increase in pay, 
and pay is more than 30-percent higher 
than for a worker of the same age but with 
no seniority. Wages show a strong direct 
association with seniority. 

For most of the other groups listed in 
Table 1, the seniority profile is quite similar. 
Even for highly educated workers and for 
workers who have very “general” skills, such 
as accountants and school teachers, a sub- 


“Department of Economics, Brigham Young Uni- 
versity, Provo, UT 84602. 1 am grateful for helpful 
discussions with David Card, Val Lambson, James B 
McDonald, John Pencavel, and participants of the In- 
dustrial Relations seminar at Princeton University. 
Some of the data used in this paper were provided by 
the Interuniversity Consortium for Political and Social 
Research. I am solely responsible for opinions and 
analyses contained in the paper. 

Technical documentation for this survey can be 
found in U.S. Department of Commerce (1989). 


stantial part of salary growth is associated 
with (if not attributable to) increases in 
seniority. For school teachers, seniority ap- 
pears to be particularly important in ex- 
plaining the life cycle of pay, while experi- 
ence seems relatively unimportant. 

The labor market for university profes- 
sors provides a curious contrast. In the small 
sample of college and university teachers 
presented in Table 1, there is little evidence 
that seniority matters at all. However, care- 
ful analysis of large samples of university 
professors shows that salaries and seniority 
are negatively correlated, after controlling 
for experience and other variables. Anecdo- 
tal evidence suggests that this can be ex- 
treme at times. In a recent survey by the 
American Council on Education (see Debra 
E. Blum, 1990), more than half of the doc- 
toral-granting institutions surveyed re- 
sponded that in some departments they had 
hired “new, junior faculty members at a 
salary above that of some senior faculty 
members in the same department.” In an- 
other recent example, two professors filed 
suit under the age discrimination act, alleg- 
ing that the practice of paying “market 
rates” to new hires but not to others had a 
disparate impact on older workers. The two 
plaintiffs were the most senior members of 
the school’s business college and were the 


lowest paid (see Bureau of National Affairs, 
1991).? 


The employees lost the case. 
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TABLE 1—SELECTED REGRESSION COEFFICIENTS, MAY 1988 CURRENT POPULATION SURVEY 


THE AMERICAN ECONOMIC REVIEW 


(DEPENDENT VARIABLE = LOGARITHM OF WEEKLY EARNINGS) 


MARCH 1993 


Marginal effect 


of seniority 
Group Age (Age)* Seniority (Seniority)? at 10 years Sample size 
All workers 0.0395 0.0315  -5.23x10~* 0.0202 16,404 
(0.0027)  (3.10x10~°) (0.0013) (4.20x 10>) 
Highly educated 0.0678  -—7.16x10~* 0.0224 -3.56x10~4 0.0153 1,457 
(18 or more years (0.0119) (1.36 10~*) (0.0056) (2.08 10~*) 
of schooling) 
School teachers — 0.0041 2.36 X 1075 0.0366 —6.14x 0.0243 572 
[156-157] (0.0146) (0.0068) (2.38 x 10~*) 
Accountants [23] 0.0936 — 0.0011 0.0251 3.69x10~4 0.0176 204 
(0.0240) (0.0003) (0.0117) (4.08x 10~*) 
College and university 0.1381 — 0.0013 — 0.0030 4.78 x 1074 0.0066 115 
teachers [113-154] (0.0431) (0.0005) (0.0211) (7.72 x 10~*) 


Notes: Standard errors are in parentheses. Numbers in brackets refer to the CPS occupational codes for the group. 
The sample consists of workers between the ages of 25 and 70 who responded to job-seniority questions. Weekly 
earnings data are right-censored (at $999) so a “Tobit” reg: sion is used. Weekly earnings for about half of the 
sample come from the June 1988 Current Population Survey. Other variables included in the analysis are education 
level, race, sex, whether Hispanic, whether resident of a metropolitan area, and region of the country. 


I. The Seniority Profile of Compensation 


There are many theories of the labor 
market that predict a positive association 
between pay and seniority. Perhaps the most 
prominent explanation is the theory of spe- 
cific human capital, explained in Walter Oi 
(1962), Jacob Mincer (1974), and many oth- 
ers. According to this theory, the firm re- 
wards the worker for acquiring skills and 
knowledge that are specific to the current 
employer. Higher wages deter the worker 
from leaving with the firm’s portion of the 
human-capital investment. 

More recently, Gary S. Becker and 
George Stigler (1974) and Edward P. Lazear 
(1981) have developed models in which the 
firm pays higher wages to more senior work- 
ers as an incentive mechanism. The promise 
of high pay motivates the worker to provide 
an appropriate level of effort, since workers 
who are caught shirking or cheating are 
dismissed from the firm and lose the reward 
of higher wages later in their careers. Un- 
like the human-capital model, this model 


does not predict higher productivity for 
more senior workers. 

In the model of Joanne Salop and Steven 
Salop (1976), firms offer steeply sloped se- 
niority profiles as a selection mechanism. 
The present value of the earnings stream is 
higher than alternatives, but the starting 
pay is lower than alternatives. The high 
wages in late career compensate for the low 
starting wages only for employees who in- 
tend to stay with the firm for a long period 
of time. 

Numerous empirical studies of wages, as 
in the analysis reported in Table 1, have 
found a positively sloped seniority profile. 
However, a positive correlation between pay 
and seniority might be observed in a cross 
section even if pay does not increase with 
seniority for individuals. For example, Boyan 
Jovanovic (1979) suggests a matching model 
in which firms and workers learn about each 
other over time. Well-matched workers are 
highly paid and stay with the firm. Poorly 
matched workers receive low wages and are 
likely to leave the firm searching for better 
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matches. As a result, average wages in- 
crease with seniority even if the wages of 
any individual do not. Efficiency wage mod- 
els predict a similar pattern.* Because indi- 
viduals are paid above alternatives, they are 
less likely to quit, so workers in high-wage 
firms will have higher seniority on average 
than workers in low-wage firms. Indeed, for 
some versions of the efficiency wage model, 
lower costs associated with lower turnover 
are crucial. 

Some recent empirical work has tried to 
determine whether wages increase with se- 
niority by studying longitudinal changes in 
the pay of individuals. In similar papers, 
Katherine G. Abraham and Henry S. Far- 
ber (1987) and Joseph G. Altonji and Robert 
A. Shakotko (1987) concluded that much of 
the cross-sectional correlation between 
wages and seniority is due to heterogeneity, 
a conclusion that is consistent with effi- 
ciency wage models or other types of match- 
ing models. However, Robert Topel (1991) 
has criticized these two papers. Using the 
same longitudinal data set but slightly dif- 
ferent estimation methods, Topel finds that 
wages of individuals increase rather sharply 
with increases in seniority. Topel reports 
that for a typical male worker in the United 
States, 10 years of seniority increase the 
wage by more than 25 percent relative to 
alternatives. This is only slightly smaller than 
the cross-sectional estimate reported in 
Table 1 for all workers. 


II. The Seniority Profile for University 
Professors 


Findings from the labor market for uni- 
versity professors are inconsistent with all of 
these theories and estimates. Here I present 
evidence from several sources that demon- 
strates a negative return to seniority for 
university professors. Data are from two 
different types of sources: first, from nation- 
wide surveys of university faculty; and sec- 
ond, from employment records of individual 
institutions. 


Joseph Stiglitz (1987) describes several versions of 
the efficiency wage model. 
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TABLE 2—SELECTED REGRESSION COEFFICIENTS 
FRoM 1969 CARNEGIE SURVEY DATA 
(DEPENDENT VARIABLE = LOGARITHM 
oF ANNUAL SALARY) 


All universities 
and colleges 


“High-quality” 


Variable universities only 


Experience: 
Q-1 years 
0.170** 
(0.015) 
0.280** 
(0.015) 
0.387** 
(0.016) 
0.528** 
(0.016) 
0.662** 
(0.017) 
0.760** 
(0.018) 
0.851** 
(0.027) 


0.099** 
(0.006) 
0.186** 
(0.006) 
(0.007) 
0.383** 
(0.007) 
0.477** 
(0.007) 
(0.008) 
0.591** 
(0.010) 


2-3 years 
4—6 years 
7-9 years 
10-14 years 
15-19 years 


20-29 years 


30 or more years 


Seniority 
Q-1 years 
2-3 years — 0.026** —0.030** 
(0.005) (0.012) 
— 0.030** —0.040** 
(0.005) (0.012) 
—0.026** — 0.026 
(0.006) (0.014) 
—0.049** —0.056** 
(0.006) (0.015) 
— 0.073** —0.110** 
(0.007) (0.017) 
—0.067** —0.128** 
(0.008) (0.018) 
—0.042** —0.154** 
(0.011) (0.026) 


4—6 years 
7-9 years 
10-14 years 
15-19 years 


20-29 years 


30 or more years 


50,214 
0.544 


11,665 
0.504 


Sample size: 


Re: 


Notes: Standard errors are given in parentheses. Other 
variables: degree type, field of teaching appointment, 
sex, race, quality of degree-granting institution, private 
or public employer, and length of contract. 

““ Coefficient is significantly different from 0 at the 
l-percent level. 


A. Estimates from National Surveys 


Tables 2, 3, and 4 report estimates from a 
salary regression model on data from three 
national surveys of teaching faculty at uni- 
versities and colleges. I report results for a 
sample of all college and university faculty 
and for a smaller sample of faculty at re- 
search universities. Table 2 refers to data 
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TABLE 3—SELECTED REGRESSION COEFFICIENTS, 
1972-1973 ACE Survey Data 
(DEPENDENT VARIABLE = LOGARITHM 
OF ANNUAL SALARY) 


MARCH 1993 


TABLE 4—SELECTED REGRESSION COEFFICIENTS, 
1977 SURVEY OF THE AMERICAN PROFESSORATE 
Data (DEPENDENT VARIABLE = LOGARITHM 
OF ANNUAL SALARY) 


All universities Research 


All universities Research 


Variable and colleges universities only Variable and colleges universities only 
Experience 0.039** 0.045** Experience: 
(0.001) (0.001) 0-2 years = = 
3-5 years 0.084** 0.099** 
(Experience)? —6.3x10~** -—7.0x 107 ** (0.021) (0.029) 
(1.8x 10~>) (21%10->) 6-10 years 0.196** 0.249** 
(0.021) (0.029) 
Seniority —3.5x10~4 —0.0037** 11-15 years 0.351"* 0.405** 
(73 (0.0009) (0.023) (0.031) 
16-20 years 0.476** 0.549** 
(Seniority)? 92% 5.4x10-™* (0.024) (0.033) 
(2.2x 107°) (2.6 10~>) More than 20 years 0.545** 0.610** 
(0.025) (0.034) 
Marginal effect of an additional year: 
At 5 years —3.0x10~4 — 0.0034** Seniority: 
of seniority (6.4x 1074) (0.0008) 0-2 years — 
At 10 years —2.6x10~4 —0.0031** 3-5 years 0.014 0.006 
of seniority (5.41074) (0.0007) (0.016) (0.022) 
6-10 years 0.026 — 0.014 
Sample size: 30,313 16,207 (0.017) (0.023) 
0.464 0.540 11-15 years — 0.001 — 0.042 
(0.019) (0.026) 
Notes: Standard errors are given in parentheses. Other 16-20 vears ~0.013 —0.069* 
variables: degree held, race, sex, field, region, city size, ; (0.022) (0.029) 
SMSA type, and contract length. More than 20 years 0.010 ~0.040 
“Coefficient is significantly different from 0 at the (0.023) (0.030) 
5-percent significance level. 
** Coefficient is significantly different from 0 at the Sample size: 3,477 2,232 
1-percent significance level. R?: 0.574 0.567 


from the Carnegie Commission National 
Survey of Higher Education: Faculty Study, 
1969.4 This survey was completed in the 
spring of 1969. There were a total of 60,028 
respondents, of which 50,214 provided com- 
plete responses to questions about salary, 
experience, seniority, and other factors con- 
sidered in the regression model. The survey 
reports experience and seniority in intervals 
only.° 

Experience shows a strong positive associ- 
ation with salary. For all colleges and uni- 
versities, salaries increase by about 4 or 5 


“This data set is available from the Interuniversity 
Consortium for Political and Social Research and is 
documented in Carnegie Commission (1978). 

Experience is here defined as the number of years 
employed in colleges and universities beyond the level 
of a teaching or research assistant. Seniority is defined 
as the number of years employed at current institution 
beyond the level of a teaching or research assistant. 


Notes: Standard errors are given in parentheses. Other 
variables: degree type, field of teaching appointment, 
race, sex, region, quality of degree-granting institution, 
and contract length (8—9 months or 11-12 months). 
* Coefficient is significantly different from 0 at the 
5-percent significance level. 
** Coefficient is significantly different from 0 at the 
1-percent significance level. 


percent per year initially, with diminishing 
marginal returns. Experience profiles are 
much steeper at “high-quality universities.””° 


°Since the 1969 study by the Carnegie Commission, 
universities and colleges in the United States have 
been classified into the following categories: Research 
Universities I and II, Doctoral-Granting Universities I 
and II, Comprehensive Colleges and Universities I and 
II, Liberal-Arts Colleges I and II, and Two-Year Col- 
leges. These categories are determined by the number 
and type of degrees granted and the amount of re- 
search funding received by the universities. For the 
1969 survey, I have assumed that the “high-quality” 
university category corresponds roughly to the more 
recent designation of Research University. 
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On the other hand, the marginal effect of 
seniority is clearly negative. For the overall 
sample, individuals with 10-14 years of se- 
niority earn about 7 percent less than indi- 
viduals with less than 2 years of seniority, 
after controlling for type of degree, field, 
sex, race, quality of institution that granted 
the individual his or her highest degree, the 
length of contract (8—9 months versus 11-12 
months), and whether the employing institu- 
tion is privately controlled. At high-quality 
universities the effect is even more pro- 
nounced. Salaries decline by about 0.5 per- 
cent per year of seniority. Individuals with 
at least 30 years of seniority earned approxi- 
mately 15-percent less than individuals with 
less than 2 years of seniority. 

The results reported in Table 3 are based 
on the American Council on Education’s 
(ACE) 1973 survey, which was conducted 
between December 1972 and June 1973.’ 
This survey was designed to parallel closely 
the 1969 Carnegie Commission survey. 


There were 53,029 respondents, of which 
30.313 provided complete information on 
the variables used in the regression models. 


In this survey, experience is defined as years 
since the highest degree was granted. Se- 
niority is measured as the number of years 
since an individual was hired by the current 
institution. 

Table 3 reports regression estimates of 
the effect of seniority and experience for 
the 1973 data. Since the variables are mea- 
sured in continuous form, I have estimated 
a model with linear and quadratic measures 
of both variables. Salaries increase rapidly 
with experience, but with diminishing 
marginal returns. The rate of growth with 
experience is larger for individuals at re- 
search universities. The estimated effect of 
seniority is negative, again. However, for 
the overall sample the hypothesis that se- 
niority has no effect cannot be rejected at 
usual significance levels.* For the sample 


’See Alan E. Bayer (1973) for a detailed discussion 
of the survey. This documentation and the data set are 
available through the Interuniversity Consortium for 
Political and Social Research. 

‘If seniority is redefined in the same intervals used 
in Table 2, the lowest-seniority group is the highest-paid 
group. Salaries for each of the other seniority intervals 
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from research universities, salaries decline 
by about 0.4 percent per year with each 
additional year of seniority. The decline is 
roughly linear, as in the 1969 sample. An 
additional year of seniority reduces the 
salary of someone with 10 years of seniority 
by about 0.3 percent. 

Table 4 refers to data from the “1977 
Survey of the American Professoriate,” ad- 
ministered by the Institute for Social In- 
quiry at the University of Connecticut. This 
survey had only 4,248 respondents, of which 
3,477 provided complete responses for the 
variables used in this analysis. Data on ex- 
perience and seniority are available only in 
interval form.’ 

The 1977 data display patterns similar to 
the other data sets. Salaries grow with expe- 
rience, and this growth is more rapid at 
research universities. The data for all uni- 
versities show essentially no correlation be- 
tween seniority and salary after controlling 
for experience and other variables. For re- 
search universities, salaries do fall with in- 
creased seniority. However, the effect is 
smaller than that observed in the 1969 and 
1973 data. The effect is largest for those 
with 16-20 years seniority, who had salaries 
that were about 7-percent lower than those 
with 2 years or less seniority, after control- 
ling for experience, education, and other 
factors. 

Figure | graphically summarizes the ef- 
fect of seniority on salary at research uni- 
versities from the analysis presented in the 
last columns of Tables 2, 3, and 4. The 1973 
and 1977 surveys suggest that the most a 
professor could gain by changing employers 
is about 5 or 6 percent. The 1969 survey 
shows a much larger potential seniority 
“penalty.” 

A possible weakness of the preceding 
analysis is that academic labor markets are 
field-specific. The demand and supply of 


are lower than for the lowest-seniority interval, and the 
hypothesis that there is no difference between the 
lowest-seniority group and the higher-seniority groups 
can be rejected with usual tests. However, the seniority 
“penalty” is about the same for all seniority levels. 

For this data, experience is defined as years em- 
ployed on a full-time basis in higher education. Senior- 
ity is defined as years employed on a full-time basis at 
the present institution. 
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FiGureE 1. EstTimMATED SENIORITY—SALARY PROFILE FOR FACULTY OF 
RESEARCH UNIVERSITIES: NATIONAL SURVEY DATA, VARIOUS YEARS 


accounting professors may be quite differ- 
eni from that of history professors. To ana- 
lyze salary determination in different aca- 
demic fields, Table 5 presents estimates for 
selected narrowly defined fields based on 
the 1973 American Council of Education 
survey, restricting the sample to faculty at 
research universities.!° At 10 years of se- 
niority, the effect of an additional year of 
seniority is negative for all fields except 
elementary and secondary education and 
chemistry. Most estimates in Table 5 indi- 
cate a decline of more than 0.5 percent per 
year, and some, such as accounting, and 
economics show a decline of more than 1 
percent per year.!! (Average seniority ranges 
from approximately 9 years to 12 years for 
these groups.) 


These models are estimated for 15 of the 75 fields 


that could be designated in the survey. Fields were 
selected to represent a broad range of specialities, but 
also to obtain a large sample size. Field refers to the 
department of the teaching appointment of the respon- 
dent. 

"Since the relationship estimated is quadratic, the 
magnitude of the seniority effect usually falls with 
higher seniority. However, in most cases the quadratic 
term is quite small, so the relationship is almost linear. 


B. Estimates from Individual Institutions 


Estimates from individual universities also 
show negatively sloped seniority profiles. For 
example, Emily Hoffman (1976) estimated a 
salary regression model with a linear senior- 
ity term for data from two large universities. 
She found a decline of 0.2 percent per year 
at the University of Massachusetts using 
1974 data, and a decline of 0.5 percent per 
year at another “large urban university,” 
using data from 1970. Her analysis con- 
trolled for age (not experience), sex, race, 
field, and degree type. (The University of 
Massachusetts is categorized as a “research 
university” under the Carnegie system. Pre- 
sumably the other “large urban university” 
is comparable.) 

I have estimated similar models for daia 
from another large state university, the Uni- 
versity of Arizona.'? Table 6 presents esti- 
mates for salaries from the 1972, 1977, and 


'2See Sharon B. Megdal and Ransom (1985) for a 
description of the data on which Table 6 is based. 
Copies of the data set are available from the author 
upon request. The University of Arizona is categorized 
a “Research University” under the Carnegie system. 
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TABLE 5—SELECTED REGRESSION COEFFICIENTS FROM 1973 ACE Survey Data: EsTIMATES BY DEPARTMENT 
OF TEACHING APPOINTMENT (DEPENDENT VARIABLE = LOGARITHM OF ANNUAL SALARY) 


Marginal effect 
of seniority 


Field Experience (Experience)* Seniority (Seniority)? —_ at 10 years R? Sample size 


0.0620** 
(0.0080) 


0.0542** 
(0.0038) 


0.0684** 
(0.0035) 


0.0650** 
(0.0058) 


0.0602** 
(0.0046) 


0.0560** 
(0.0041) 


0.0510** 
(0.0052) 


0.0604** 
(0.0084) 

0.0427** 
(0.0100) 


0.0267** 
(0.0054) 


0.0621** 
(0.0097) 


0.0260** 
(0.0064) 


0.0534** 
(0.0073) 


0.0180** 
(0.0067) 


0.9519** 
(0.0046) 


—0.0011** 
(0.0002) 


— 0.0007** 
(0.0001) 


—0.0011** 
(0.0001) 


—0.0010** 
(0.0001) 


— 0.0009** 
(0.0001) 


— 0.0009** 
(0.0001) 


— 0.0008** 
(0.0002) 


—0.0013** 
(0.0002) 


—0.0007** 
(0.0002) 


— 0.0003** 
(0.0001) 


—0.0011** 
(0.0001) 


— 0.0003 
(0.0002) 


—0.0009** 
(0.0002) 


— 0.0002 
(0.0062) 


— 0.0008** 
(0.0001) 


Microbiology — 0.0138* 


(0.0070) 


— 0.0070 
(0.0046) 


— 0.0087* 
(0.0036) 


0.0052 
(0.0057) 


— 0.0078 
(0.0056) 


— 9.0094 
(0.0050) 


0.0004 
(0.0059) 
— 0.0263* 
(0.0086) 


— 0.0086 
(0.0105) 


0.0036 
(0.0061) 


— 0.0121* 
(0.0058) 


— 0.0085 
(0.0063) 


— 0.0023 
(0.0072) 

— 0.0123* 
(0.0059) 


—0.0114* 
(0.0053) 


0.0003 
(0.0002) 
—2.4x 1075 
(1.4x 1074) 
0.0001 
(0.0001) 


— 0.0003 
(0.0002) 


4.0« 107° 
(0.0002) 


3.1x107> 
(1.0x 1074) 


0.0002 
(0.0002) 


—0.0122* 0.570 237 
(0.0062) 


— 0.0073* 0.668 
(0.0034) 


—0.0077* 0.692 
(0.0027) 


0.0024 0.638 
(0.0042) 


— 0.0079 0.722 
(0.0041) 


—0.0097** 0.635 
(0.0037) 


— 0.0019 0.650 
(0.0042) 


—0.0189** 0.725 
(0.0065) 


— 0.0067 0.440 
(0.0081) 


0.0023 0.667 
(0.0045) 


—0.0090* 0.753 
(0.0043) 


— 0.0054 0.434 
(0.0049) 


— 0.0037 0.261 
(0.0051) 


— 0.0060 0.691 
(0.0044) 


— 0.0093* 0.521 
(0.0039) 


History 

Mathematics and statistics 
Chemistry 

General psychology 
Economics 

Sociology 


Accounting 0.0007* 


(0.0002) 


Management — 0.0002 


(0.0003) 


Elementary and 
secondary education 


0.0001 
(0.0002) 


Chemical engineering 0.0003 


(0.0002) 


0.0003 
(0.0002) 


Music 


Medicine or surgery — 0.0001 


(0.0002) 


Nursing 0.0006** 


(0.0002) 


0.0002 
(0.0002) 


English language 
and literature 


Note: Standard errors are given in parentheses. 
* Coefficient is significantly different from 0 at the 5-percent significance level. 
** Coefficient is significantly different from 0 at the 1-percent significance level. 


1982 academic years. The estimated coef- 
ficients indicate that, initially, an additional 
year of seniority reduces salary by 0.5-1.5 
percent. The results from 1982 suggest that 
salaries decline at a rate of about | percent 
per year, a much larger effect than observed 
in the national samples. The estimated se- 
niority profile “bottoms out” at about 15 
years for the 1972 sample and at about 25 
years for the 1977 sample. Figure 2 graphi- 
cally shows the corresponding estimated se- 
niority—salary profiles. 


In summary, data from large research- 
oriented universities in the United Sates 
show that individuals with high seniority are 
paid less, on average, than individuals with 
low seniority, after controlling for total ex- 
perience, sex, field, and other variables.'* 


For other types of institutions, the effect of senior- 
ity on salaries is not so clear. For example, analysis of 
separate samples estimates positive returns to seniority 
for junior-college faculty in the 1969 and 1973 data, 
negative returns for liberal-arts colleges in the 1977 
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FiGurE 2. EsTIMATED SENIORITY—SALARY PROFILES FOR FACULTY OF THE 


UNIVERSITY OF ARIZONA, VARIOUS YEARS 


TABLE 6—SELECTED REGRESSION COEFFICIENTS FROM ANALYSIS OF UNIVERSITY OF ARIZONA SALARY DATA 


(DEPENDENT VARIABLE = LOGARITHM OF ANNUAL SALARY) 


Variable 1972 1977 1982 
Years experience 0.0317** 0.0440** 0.0439** 
(0.0018) (0.0017) (0.0021) 
(Experience)? —0.0005** —0.0006** —0.0006** 
(4.0 107°) 
Years seniority — 0.0058** —0.0144** —0.0111** 
(0.0020) (0.0019) (0.0022) 
(Seniority)? 0.0002* _ 0.0003** 0.0001 
(7.4 107°) (5.6 10~>) 
Other variables whether Ph.D., sex, whether Ph.D., sex, whether Ph.D., sex, 
52 department indicators 44 department indicators 87 department indicators 
Sample size: 1,085 L127 1,197 
0.5895 0.6737 0.6407 


Note: Standard exrors are given in parentheses. 
“Coefficient is significantly different from 0 at the 5-percent significance level. 
** Coefficient is significantly different from 0 at the 1-percent significance level. 


While estimates vary from sample to sample salaries decline by about 0.5 percent per 
and from year to year, it appears that year of seniority. For a typical professor, 
with about 10 years of seniority, changing 


employers would result in an increase in pay 


data, and positive returns for comprehensive colleges of from around 5 percent (using estimates 
and universities in the 1977 data. from the national samples) to about 10 per- 
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cent (using estimates from the University of 
Arizona sample). This analysis only consid- 
ers salaries. Nonmoney compensation at 
universities is also probably much less tied 
to seniority than at the typical firm. For 
example, one of the biggest components of 
in-kind pay is paid leave. In the United 
States, paid leave is traditionally tied to 
seniority. In most academic institutions, 
leave time is similar for professors of all 
seniority levels. 

Durable employment relationships at uni- 
versities are associated with low salaries! 
This result is astonishing in the face of the 
current economic theories of the seniority— 
pay relation. The human-capital model, for 
example, suggests no reason why pay would 
fall with higher seniority, even in cases 
where there is no “specific” human capital. 
Likewise, the well-known matching and 
deferred-compensation models predict that 
durable employment relationships will be 
high-paying. 


III. Worker Quality and Seniority 


One possible explanation for the negative 
effect of seniority on salaries in cross sec- 
tions is that individuals with long seniority 
are simply less productive. For example, 
one argument is that the best professors can 
find employment anywhere, but the less able 
have to stay where they are. This is the 
basic flavor of two models that have ap- 
peared in the literature. Milton Harris and 
Bengt Holmstrom (1982) suggest a model in 
which a worker of unknown ability is ini- 
tially offered a guaranteed lifetime wage. 
Over time, the worker’s productivity is re- 
vealed to all firms. If a worker turns out to 
be very productive, the wage will be in- 
creased to match outside offers. However, if 
an individual turns out to have low produc- 
tivity, no outside offer will dominate the 
initial guaranteed wage. Those receiving 
outside offers are more likely to change 
firms, so high-seniority individuals will tend 
to have low productivity and low wages. 

Lazear (1986) has proposed a similar 
“lemon-flavored” model. As in the Harris- 
Holmstrom model, firms are initially imper- 
fectly informed about worker quality. As a 
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result, some workers are underpaid. Out- 
side firms identify underpaid workers and 
attempt to hire them. These “raider” firms 
succeed if they provide a better match for 
the employee than the incumbent firm. Oth- 
erwise, the incumbent firm matches outside 
offers, and the employee stays. “Raiding” 
by competitors results in higher wages and 
higher turnover for high-quality workers, so 
high-seniority employees tend to be of lower 
quality and to have lower wages. Low-qual- 
ity workers remain with the incumbent firm 
because of the firm’s persistent ignorance. 
In Lazear’s model, the negative relationship 
between seniority and pay depends on em- 
ployees being valued differently at different 
firms, since employees only move when they 
find the raiding firm to be a better “match.” 

These models justify lower pay for higher 
seniority because of lower productivity. Al- 
though “quality” is a difficult concept to 
measure, an important product of scholarly 
activity at research universities is publica- 
tion. One way to control for productivity is 
to include variables that measure publica- 
tion activity in the salary regression models. 
Table 6 duplicates the analysis found in 
Tables 2, 3, and 4 but also includes vari- 
ables to measure the professional publica- 
tions of the individual.'* It is clear from 
Table 6 that a strong negative correlation 
between salaries and seniority persists, even 
after controlling for each individual’s publi- 
cation activity. In fact, the estimated penalty 
for another year of seniority is even greater 
for the 1973 data when publication variables 
are included in the regression. For the 1969 
analysis, the penalty is greater for seniority 
intervals up to 10 years. Professors with 
high seniority do not differ much from indi- 
viduals with low seniority as far as publica- 
tion records are concerned. This result sug- 
gests that selection by quality is not an 
adequate explanation for the negative asso- 
ciation between salary and seniority. 


'*Publications are measured by a set of interval 
dummies for (i) number of books, monographs, or 
manuals published and (ii) number of articles pub- 
lished in academic or professional journals. 
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IV. A Model of Monopsonistic Discrimination 


I propose an alternative “heterogeneity” 
model to explain the negative slope of the 
seniority profile. In this model, moving costs 
drive a wedge between the “market” wage 
and the wage the employer must pay. Em- 
ployees with high moving costs will tend to 
have high seniority and low pay if the em- 
ployer is able to discriminate. 

Consider a simple model of a university 
that is able to offer a different salary to 
each current faculty member. Individual 
faculty members must incur costs to move 
to another job.'° The university must meet 
the market wage to replace a faculty mem- 
ber who rejects the offered salary and 
chooses to leave. With some uncertainty 
about an employee’s willingness to move, 
the expected cost to the employer of a spe- 
cific wage offer would be 


(1) EC(w,) = 
[1- p(w,,m,w,)|w, 


where EC is the expected employment cost 
(to fill a teaching position), w, is the wage 
offered to the individual, w, is a competi- 
tively determined wage rate (what the indi- 
vidual could get if he or she moved), and m 
is the cost to an individual of moving to a 
different institution. The probability that an 
individual will accept the offered wage is 
p(-), a function of the wage offer, the alter- 
native wage, and the individual’s cost of 
moving. This model assumes that both em- 
ployer and employee know the employee’s 
alternative salary. The model accomme- 
dates no offers or counteroffers.'° 

The only source of uncertainty is that the 
employer cannot know whether an em- 
ployee will move at a given offered wage, 
although the probability of a move is known. 


'SMost universities are isolated from one another 
geographically, so it is fair to assume that high moving 
costs characterize the market for university professors. 

In other words, this wage offer is the /ast wage 
offer that the firm wili make. The employee either 
takes it or leaves. 
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A plausible structure for p is 
(2) p(w,,m,w,) = p(w, + 6m — w,) 


where 6 is a discount factor. In this specifi- 
cation, individuals weigh the increased an- 
nual income, w,—w,, against the annual- 
ized cost of moving, 5m. Those with higher 
moving cost will be more likely to accept 
any given wage offer. I assume that the 
function p is continuous, nonnegative, dif- 
ferentiable, and monotonically increasing. 
The wage offer that minimizes the ex- 
pected employment cost, w* is implicitly 
defined by the first-order condition: 


(3) (we —w,)p' + p=0 

where p’ is the derivative of the function p. 
Since both p and p’ are positive, w* must 
be less than w,. The sufficient condition for 
cost minimization is satisfied if 


(4) 2p’ —w,)p” > 0. 


Taking the total differential of (3), using (2), 
yields the comparative-statics result: 


p’ + (we w,)p" 
2p’ +(wx —w,)p” 


<0. 


Thus universities offer lower wages to those 
who find it most costly to move. Since the 
expression in brackets in equation (5) must 
be between 0 and 1, for each dollar increase 
in the discounted cost of moving the offered 
wage is reduced by less than one dollar. 
Therefore, individuals with higher moving 
costs are more likely to accept the offered 
salary. Explicitly, the total derivative of 
equation (2) with respect to m is 


d 
(6) =5(p +P’) 


+ (ws w,)p" 
2p'+(we — 


The expected seniority of an individual is 
closely related to the value of p. Each year 


dw* 
2 
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an individual is offered a salary. If the salary 
is acceptable, the individual stays; other- 
wise, he or she leaves. We observe someone 
with T years of seniority at an institution 
only if he or she has accepted the wage 
offer for T consecutive years. Consider all 
professors who have the same values for w, 
and m, (and therefore w* and p). The 
fraction who move in any year will be 1— p, 
so the proportion of this group with no 
seniority will be 1 — p. Likewise, the propor- 
tion with 1 year of seniority will be (1— p)p: 
the proportion with 2 years will be (1 — p)p?; 
and so on. Thus the proportion with T years 
of seniority will be p’(1— p). Ignoring the 
fact that professors have finite careers, ex- 
pected seniority is 


(7) E(S)=(1-p) 
7=0 p 


Expected seniority increases with p, and p 
increases with m, so professors with high 
moving costs will have higher average se- 
niority. 

Because high values of m not only lead to 
high values of seniority, but also to low 
salaries, seniority and wages will be in- 
versely related. In other words, comparing 
individuals who would obtain the same 
salary in the market, those with higher cur- 
rent seniority will have lower salaries, on 
average. However, this model does not pre- 
dict that salaries will fall with seniority for 
individuals; the negative seniority effect is 
simply an artifact of heterogeneity. The esti- 
mated regression coefficient for seniority is 
biased downward because the unobserved 
moving-cost variable, m, is not included. 
Thus, this model has much in common with 
the matching model of Jovanovic (1979), 
although the implications for the relation- 
ship between pay and seniority are exactly 
opposite. Similarly, the recent the papers by 
Abraham and Farber (1987), Altonji and 
Shakotko (1987), and Topel (1991) are pri- 
marily concerned with sources of hetero- 
geneity that bias the effect of seniority 
upward. Apparently, the monopsony het- 
erogeneity dominates other sources of het- 
erogeneity for the academic labor market. 
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Recent papers, such as Topel (1991), have 
carefully controlled for heterogeneity using 
panel data. Appropriate data for this type 
of analysis of the academic labor market are 
not available. However, since this model 
identifies moving costs as the source of the 
sample heterogeneity, if it were possible to 
include a true measure of moving costs in 
the salary regression model, then the esti- 
mated coefficient for seniority would not be 
biased downward. Unfortunately, moving 
costs are difficult to quantify and measure. 
Not only do they include the costs of physi- 
cally relocating the household, but they also 
include psychic costs, such as the pain of 
separating from friends and family. Further- 
more, the available data are limited: it is 
not even possible to identify city of resi- 
dence from the national data sets analyzed 
in this paper. 

Marital status, spouse work status, and 
number of children certainly influence the 
mobility of families and have often been 
identified as such in studies of geographical 
mobility.'’ If the presence of a spouse in- 
creases mobility costs, then one would ex- 
pect, on the basis of the monopsony model 
presented here, that married men would 
have higher seniority and lower salaries than 
single men. However, a simple regression 
analysis of the 1973 survey'® shows that 
although married men do have more senior- 
ity than single men, controlling for experi- 
ence, education, publications, and other 
variables, a regression analysis of salaries 
shows that married men have significantly 
higher salaries than single men. This mar- 
riage premium is a_ well-known phe- 
nomenon that is usually explained in terms 
of heterogeneity. In other words, although 
married men probably have higher moving 
costs, they also differ from single men in 


For example, Jacob Mincer (1978) explores in 
some detail the relationship between labor-force par- 
ticipation and geographical mobility for married cou- 
ples. 

~The 1973 survey provides the most convenient 
data for analyzing seniority levels. The other large 
surveys report seniority in interval form only. 
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unobservable ways that influence salaries. 
Thus, marital status fails to provide a test of 
the monopsony model. 

Other types of questions arise from vari- 
ables of this type, too. For example, we 
might expect that the moving costs of a 
married couple would be higher if both 
spouses have jobs. Indeed, men who have 
working spouses have higher seniority than 
men whose spouses do not work, controlling 
for other factors. Furthermore, married men 
whose spouses work earn less than married 
men whose spouses do not work, also as 
predicted by the monopsony model. On the 
other hand, family-labor-supply models pre- 
dict that wives are less likely to work as the 
income of their husband increases. If the 
labor participation of the wife is dependent 
on the income of the husband, then one 
cannot test the monopsony theory with this 
variable either. Convincing tests of the 
monopsony heterogeneity model require 
better data than are available in the surveys 
analyzed in this study. 

Why do we observe this unexpected rela- 
tionship between seniority and pay in uni- 
versity labor markets but not in other labor 
markets? The fact that academic human 
capital is very general might contribute to 
this finding. However, even in occupations 
such as school teaching and accounting, 
where firm-specific human capital would 
seem to be unimportant, we see large posi- 
tive returns to seniority. Furthermore, stud- 
ies usually find that heterogeneity biases the 
returns to seniority upward, as most popular 
theories predict, not downward. There is 
something peculiar about the institutions of 
the academic labor market that make the 
monopsony heterogeneity more important 
than in other markets. One contributing 
factor is that distances between universities 
are relatively large. Except for the largest 
metropolitan areas, universities are thinly 
distributed geographically, making moving 
costs higher for professors than for other 
occupations. Another aspect of the aca- 
demic labor market that might increase the 
importance of monopsony is the individual- 
ized nature of pay. For example, Debra A. 
Barbezat (1989) found positive returns to 
seniority at colleges represented under 
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collective-bargaining agreements, while re- 
turns were negative for employees of other 
institutions. 


V. Summary 


Nationwide data from large research- 
oriented universities show a negative rela- 
tionship between seniority and salary of 
professors, even after controlling for indi- 
viduals’ publication records, education, and 
experience. Salary data collected at three 
large universities also exhibit a negative re- 
turn of seniority on salaries. The national 
data suggest that salaries fall by as much as 
0.5 percent per year of seniority. Employ- 
ment data from individual universities sug- 
gest that the decline may be about | percent 
per year of seniority. A professor of average 
seniority could increase his or her salary by 
5-10 percent by moving to a different insti- 
tution. This is dramatically different than 
the 25-percent decline that Topel (1991) 
estimates for the typical American worker. 

These findings contradict widely accepted 
theories of wage determination but are con- 
sistent with a heterogeneity model derived 
from monopsonistic salary discrimination by 
universities. Individuals with high moving 
costs receive lower salary offers and have 
higher seniority than individuals with low 
moving costs. Differences in publication ac- 
tivity between high-seniority and low-senior- 
ity professors are unable to explain the neg- 
ative returns to seniority, as some other 
heterogeneity models suggest. 
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The Decline in Black Teenage Labor-Force Participation 
in the South, 1900-1970: The Role of Schooling 


By Ropert A. Marco AND T. ALDRICH FINEGAN* 


In 1950, approximately 61 percent of 
Southern black males aged 16-19 were par- 
ticipating in the labor force. By 1970 their 
participation rate had declined to 34 per- 
cent, or by 27 percentage points in two 
decades. Perhaps the most widely accepted 
explanation of the post-1950 decline in black 
teen participation emphasizes shifts in labor 
demand (John Cogan, 1982; Gavin Wright, 
1986; see also Richard Day, 1967). Between 
1950 and 1960 the introduction of the me- 
chanical harvester in cotton agriculture sub- 
stantially reduced the demand for the labor 
of black male teenagers, who were concen- 
trated in cotton production. Black teens dis- 
placed from Southern cotton farms did not 
find ready work in the South’s nonfarm 
economy. Their failure to find nonfarm jobs, 
according to Cogan, was caused by in- 
creases in the coverage of the federal mini- 
mum wage in the 1960’s. The combination 
of cotton mechanization and the minimum 
wage drastically reduced employment op- 
portunities for black male teens in the 
South, resulting in the sharp decline in their 
labor-force participation between 1950 and 
1970. 

We argue that the post-1950 effects of 
cotton mechanization and the minimum 
wage have been overrated. A portion of the 
post-1950 decline in black teen participation 
was the continuation of a downward trend 


*Department of Economics, Vanderbilt University, 
Nashville, TN, 37235. We are grateful to Lee Alston, 
Jeremy Atack, Stanley Engerman, Keith Inlanfeldt, 
Thomas Kane, Frank Sloan, Gavin Wright, and semi- 
nar participants at Georgia State University, the Uni- 
versity of Illinois, the National Bureau of Economic 
Research, Northwestern University, the University of 
Wisconsin, and the ASSA-Cliometrics Society meetings 
for helpful comments. 
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whose origins can be dated at least to the 
early 1900’s. The secular downward trend in 
participation was the mirror image of a 
long-term upward trend in black school en- 
rollment. Because the factors producing the 
long-run increase in schooling had not yet 
run their course by mid-century, a decline 
in black teen participation in the South 
between 1950 and 1970 would have hap- 
pened anyway, even if cotton mechanization 
and changes in the level and coverage of the 
minimum wage had been delayed until the 
1970’s. 


I. Estimates of Labor-Force 
Participation Rates 


Census data provide the basis for estimat- 
ing pre-1950 labor-force participation rates 
of black male teens (ages 16-19) in the 
South. Our estimates are shown in Figure 1, 
along with post-1950 participation rates.' In 
1900, 86 percent were participating in the 
labor force. Their participation rate fell by 
10 percentage points between 1900 and 
1920, and by an additional 15 percentage 
points between 1920 and 1950—or an aver- 
age rate of decline of almost 5 percentage 
points per decade between 1900 and 1950. 
Fully 48 percent of the decline in participa- 
tion between 1900 and 1970 took place be- 
fore 1950. 

Cogan (1982) attributed the post-1950 de- 
cline in black teen participation to the nega- 
tive effects of mechanization on agricultural 
labor demand, primarily in cotton agricul- 
ture, and to the effects of the minimum 


'We do not estimate an overall participation rate 
for 1910 because the census is thought to have over- 
counted participation among farm children in that year 
(John Durand, 1948). 
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MARGO AND FINEGAN: BLACK LABOR-FORCE PARTICIPATION 
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FiGure 1. LABor-ForcE PARTICIPATION RATES OF SOUTHERN BLACK 
MA AGED 16-19 


Note: The points in the graph are: 1900, 85.6 percent; 1920, 75.7 percent; 1930, 73.9 
percent; 1940, 63.9 percent; 1950, 60.8 percent; 1960, 42.2 percent; 1970, 33.8 percent. 


Source: See Data Appendix. 


wage on labor demand off the farm.” Be- 
tween 1950 and 1970 the number of man- 
hours required to produce a bale of cotton 
declined sharply after the widespread diffu- 
sion of the mechanical cotton picker. Be- 
cause cotton production remained constant 
over the period, the decline in labor re- 
quirements translated into a one-for-one re- 
duction in labor demand. Cogan argued that 
virtually the entire decline in employment 
of black teens in the South between 1950 
and 1970 could be explained by the decline 
in their employment in agriculture. Al- 
though some black teens found jobs in man- 


2 

“Cogan analyzed the decline in the employment-to- 
population ratio, not the participation rate. This ratio 
fell by 27 percentage points between 1950 and 1970, 
the same as the decline in the participation rate (Cogan, 
1982 p. 623). We focus on participation rates because 
employment ratios cannot be constructed for census 
years prior to 1930. 


ufacturing, growth in their employment in 
retail trade slowed considerably in the 
1960’s, precisely when coverage of the mini- 
mum wage was extended to the industry. 
Whatever the merits of Cogan’s argu- 
ment, Figure 1 demonstrates that participa- 
tion rates of black male teens were al- 
ready falling between 1900 and 1950. Had 
post-1950’s participation rates continued to 
decline at the average decadal rate experi- 
enced between 1900 and 1950 (4.96 percent- 
age points), the predicted participation rate 
in 1970 would have been 51 percent. The 
difference between this predicted 1970 rate 
and the actual 1950 rate accounts for 37 
percent ( = 9.9/27) of the decline in partici- 
pation between 1950 and 1970. If the pre- 
dicted 1970 rate is calculated using the av- 
erage (decadal) rate of decline between 1930 
and 1950, the percentage explained is 49 
percent. The decline in black teen partici- 
pation accelerated after 1950, but a post- 
1950 reduction in participation could have 


| 
.| 
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been predicted on the basis of the pre-1950 
trend. Indeed, such a prediction was made 
in 1948.3 

Although the problems of designing a re- 
liable mechanical cotton picker had largely 
been solved by the late 1920’s, marketing of 
the machine was delayed until the early 
1940’s, and the picker did not diffuse until 
after 1950 (Cogan, 1982 p. 627; Warren 
Whatley, 1987 pp. 66-7). Cotton mecha- 
nization, therefore, cannot explain the pre- 
1950 trend. Might some other type of agri- 
cultural mechanization or the minimum 
wage account for the pre-1950 decline in 
participation? 

Consider the decline in participation be- 
fore 1930. The minimum wage cannot possi- 
bly explain the pre-1930 decline in partici- 
pation because minimum-wage legislation 
was first enacted in 1938.4 The principal 
type of mechanization occurring in Ameri- 
can agriculture prior to World War II was 
the diffusion of tractors (Moses Musoke, 
1981). The diffusion of tractors was far 
slower in the South than elsewhere in the 
country. In 1930, only 4 percent of Southern 
farms used tractors (U.S. Bureau of the 
Census, 1952a p. 226). Yet the participation 
rate of black teens fell by 12 percentage 
points between 1900 and 1920, before virtu- 
ally any tractors had been introduced into 
Southern agriculture (U.S. Bureau of the 
Census, 1952a p. 226). 

Tractorization or the minimum wage, 
however, might account for some of the 
decline in participation between 1930 and 
1950. The proportion of Southern farms us- 
ing tractors rose to 26 percent by mid- 
century (U.S. Bureau of the Census, 1952a 
p. 226). Although tractors did not substan- 
tially reduce labor requirements in cotton 


3Durand (1948 pp. 66, 254) predicted a decline of 
7.6-percentage points in the participation rate of non- 
white males aged 14-19 between 1950 and 1960. 

“Wright (1986) argued that the National Industrial 
Recovery Act (NIRA) was the first federal minimum- 
wage legislation. Our argument about the pre-1930 
decline in participation is unaffected, however, because 
the NIRA was passed in 1933. 
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compared with the mechanical cotton 
picker, tractorization was associated with a 
decrease in sharecropping and other forms 
of tenancy and with a redistribution of cot- 
ton production to the Southwest and Cali- 
fornia (Musoke, 1981; Whatley, 1983; Lee 
Alston, 1985). Because tenant labor was fre- 
quently supplied in family groups, the de- 
cline in tenancy may have reduced the agri- 
cultural demand for black teen labor, as 
might the spatial reallocation of cotton pro- 
duction. Estimates of industry coverage rates 
of the minimum wage indicate that about 20 
percent of employed Southern black teens 
were in covered employment in 1950.° 

To evaluate the pre-1950 impact of trac- 
torization and the minimum wage, we esti- 
mated a regression using data for 16 
Southern states from 1930 and 1950. The 
dependent variable is the change in the 
black teen participation rate (d LFPR) be- 
tween 1930 and 1950. The independent 
variables are the change in proportion of 
farms using tractors (dTRACTORS; from 
U.S. Bureau of the Census, 1952a p. 226) 
and the coverage rate of the minimum wage 
in 1950 (dCOVERAGE). Observations were 
weighted by the average of population 
counts in 1930 and 1950. The results are: 


dLFPR = —0.105 — 0.042 dTRACTORS 
(1.356) (0.265) 


—0.014 dCOVERAGE 
(0.043) 


where absolute values of f statistics are 
shown in parentheses. The regression com- 
pletely fails to explain the variance in the 
dependent variable (R* = —0.16), although 
the coefficients of the tractorization and 


>See the Data Appendix on the construction of the 
coverage rates. The average coverage rate was low 
because most jobs held by black teens were in agricul- 
ture or the service sector, and most jobs in these 
industries were exempt from the minimum wage in 
1950. 
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coverage variables are both negative.° The 
decline in participation between 1930 and 
1950 was 13.1 percentage points (Fig. 1). 
Treating the coefficients as the best point 
estimates of their true effects, tractorization 
and the minimum wage can explain about 
20 percent [= 1-—(0.105/0.131)] of the 
1930-1950 decline in participation.’ Varia- 
tions across states in the rate of decline 
were essentially unrelated to the dif- 
fusion of tractors or the extent of coverage 
of the minimum wage in 1950. 

The regression and Figure | might be 
criticized, however, because of alterations in 
labor-force statistics over the period. In 1940 
the definition of the labor force changed to 
the labor-force-week concept. As Durand 
(1948) pointed out, the pre-1940 participa- 
tion rates are not, strictly speaking, compa- 
rable with the post-1940 estimates (see also 
Stanley Lebergott, 1964). Two responses to 
this criticism can be made. First, changes in 


°We reestimated the regression with the coverage 
rate defined to be the percentage in covered employ- 
ment multiplied by the ratio of the minimum wage in 
1950 to the average hourly wage in manufacturing in 
the state. The substantive results were unchanged. 

"The impact of the minimum wage may be under- 
stated because we are examining participation and not 
employment. The minimum wage can increase partici- 
pation through the added-worker effect, while sim- 
ultaneously reducing employment. Although it is not 
possible to measure the change in the employment-to- 
population ratio for Southern black males aged 16-19 
between 1930 and 1950, it is possible to calculate (from 
U.S. Bureau of the Census, 1931) a proxy for the 
unemployment rate of 15-19-year-olds in 1930 (the 
number of unemployed in the census week divided by 
the count of gainful workers). Since the unemployment 
rate in 1950 of 15—19-year-olds is known (U.S. Bureau 
of the Census, 1953), we can estimate the decline in 
the employment-to-population ratio for 15—19-year- 
olds and compare it with the decline in participation. 
The employment-to-population ratio fell 14.9 percent- 
age points between 1930 and 1950 for 15—19-year-olds, 
compared with a decline in participation of 13.1 per- 
centage points. The decline in employment was only 
slightly larger than the decline in participation, consis- 
tent with our finding that the effect of the minimum 
wage prior to 1950 was small. 
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the definition of the labor force have little 
bearing on the finding of a pre-1940 decline 
in participation, because the gainful-worker 
definition was used in 1900 and 1930.5 Sec- 
ond, Durand (1948) calculated adjustment 
ratios in an attempt to make the pre- and 
post-1940 participation rates comparable. 
The force of Durand’s adjustments was to 
reduce the size of the labor force prior to 
1940. Had the 1940 census question been 
asked in 1900 and 1930, participation rates 
in both years would have been lower. 

Data limitations prevented Durand (and 
us) from estimating decade-specific adjust- 
ment ratios for Southern black male teens, 
but Durand did estimate an adjustment ra- 
tio for all males aged 14-19 in 1930: 0.9756 
(Durand, 1948 p. 199). Multiplying our esti- 
mate of the 1930 participation rate for 
Southern black male teens by Durand’s ad- 


justment ratio produces an adjusted partici- 


pation rate of 72.1 percent, compared with 
the measured participation rate of 73.9 per- 
cent. Even if we assume an adjustment ratio 
equal to the lowest ratio estimated by 
Durand for any group of males (0.8953 for 
males aged 75 and over), there still was a 
decline in black teen participation between 
1930 and 1950 (Durand, 1948 p. 199).? It is 
unlikely that adjusting for definitional 
changes would alter the finding of a decline 
in black teen participation in the South 
before 1950. 


“It might be argued that the measured decline in 
participation between 1900 and 1920 was overstated 
because the 1920 census was taken in January, a sea- 
sonal bottom in teen employment, while the 1900 cen- 
sus was taken in June, a seasonal peak. Durand (1948 
p. 194), however, argued that “the proportion of youth- 
ful seasonal workers reported as having gainful occupa- 
tions was not very much affected by their employment 
status at the time the census was taken.” In addition, 
Paul Osterman (1980 p. 174) claimed that instructions 
to census enumerators caused some youth employment 
to be missed in 1900, which would understate the 
measured decline in black teen participation between 
1900 and 1920. 

Applying the adjustment ratio for males aged 75 
and over, the estimated 1930 participation rate is 66.1 
percent. 5 percentage points higher than the 1950 rate. 
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TaBLE 1—CIvitiAN LAaBor-FoRCE PARTICIPATION OF SOUTHERN BLACK MALE TEENS, 
AGED 16-19: By SCHOOL ENROLLMENT Status, 1900-1970 


Percentage in —— Percentage in labor force 
Year farm sector in schooi Dropouts In school Total 
Farm: 
1900 59.5 seas 90.2 66.7 86.0 
1930? 29.2 88.5 55.1 78.7 
1950 40.1 38.4 89.6 46.3 72.8 
1960 18.2 57.3 73.6 26.1 46.4 
1970 4.7 70.9 66.4 12.2 28.0 
Nonfarm: 
1900° 11.9 92.8 44.0 86.9 
1950 45.5 80.1 22.0 S27 
1960 60.3 68.9 23.3 41.3 
1970 66.0 61.8 20.9 34.1 


Notes: Civilian labor-force participation rates for all black male teens shown in the 
“total” column were calculated from regional data; rates for enrollment subsets were 
calculated from state data with some missing observations (see the Data Appendix). 
Hence, the total rates differ slightly from weighted averages of the within-enrollment- 
status rates. 

Sources: 1900, authors’ calculations from 1900 and 1910 census public-use samples, see 
note b below; 1930, U.S. Bureau of the Census (1935 p. 226); 1950, 1960, and 1970, see 
the Data Appendix. 

*Participation rates for the farm population by enrollment status are available for 
the age group 16-19 for the entire United States, but not the South. However, 98 
percent of the rural black farm population resided in the South in 1930 (U.S. Bureau 
of the Census, 1935 p. 51). The fraction of Southern black male teens in 1930 in the 
rural farm sector cannot be calculated from published census data. 

Nonfarm 1900 combines 1900 and 1910 public-use samples (see Population Stud- 
ies Center, 1989) because sample size in 1900 is too small to permit reliable disaggre- 
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gation by enrollment status at the regional level. 


II. Labor-Force Participation and 
Schooling: The Impact of Age-in-Grade 
Retardation 


If the minimum wage, tractorization, or 
biases in census data cannot explain the 
majority of the pre-1950 decline in black 
teen participation, which factors can? Some 
clues are suggested by Table 1, which gives 
participation rates by farm-nonfarm and 
school enrollment status, along with the 
percentage on farms and the percentage 
enrolled in school. 

At the turn of the century over 90 percent 
of Southern black male teen “dropouts” 
(i.e., those not enrolled in school) were par- 
ticipating in the labor force. Participation 
rates of in-school teens were lower than 
participation rates of dropouts but were still 
relatively high by comparison with the rates 


experienced after World War II. Despite a 
lower participation rate among in-school 
nonfarm teens, the overall participation rate 
did not differ between the farm and non- 
farm population, because the school enroll- 
ment rate was actually higher among farm 
teens. 

In 1950, the participation rate of farm 
teen dropouts was approximately the same 
as in 1900. The participation rate of in- 
school farm teens, however, declined by 20 
percentage points, and the proportion in 
school increased by 21 percentage points 
between 1900 and 1950. Among nonfarm 
teens, the percentage enrolled in school in- 
creased by 34 percentage points, and the 
participation rate of in-school black teens 
declined by 22 percentage points between 
1900 and 1950. Unlike the farm population, 
however, the participation rate of nonfarm 
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TABLE 2—HyPoTHETICAL PARTICIPATION RATES: SOUTHERN BLACK MALE TEENS 


Rate 


Participation rate (percentage) 
Total 


Farm Nonfarm 


A. Hypothetical rates for 1900 given: 


1950 enroliment rates 
(Percentage explained) 


1950 enrollment rates and 
1950 in-school labor-force 
participation rates 

(Percentage explained) 


1950 not-in-school labor-force 
participation rates 
(Percentage explained) 


. Hypothetical rates for 1950 given: 


1970 enrollment rates 
(Percentage explained) 


1970 enrollment rates and 
1970 in-school labor-force 
participation rates 

(Percentage explained) 


1970 not-in-school labor-force 
participation rates 
(Percentage explained) 


38.7 
(87.4) 


50.6 
(42.0) 


Notes: Figures give counterfactual labor-force participation rates under various as- 
sumptions. For example, row 2 of panel A calculates participation rates in 1900 
assuming 1950 school enrollment rates, 1950 in-school labor-force participation rates, 
and no other changes between 1900 and 1950. The numbers in parentheses are the 
percentages of decline in actual labor-force participation rates between 1900 and 1950 
or between 1950 and 1970 explained by the difference between the actual and 
counterfactual labor-force participation rates. 


Source: Calculated from Table 1. 


teen dropouts decreased by almost 13 per- 
centage points. Nonfarm teens experienced 
a decline in participation over 2.5 times as 
large as the decline in participation among 
farm teens because of the larger rise in 
nonfarm school enrollment and the fall in 
participation among nonfarm teen dropouts. 

School enrollment rates continued to 
climb between 1950 and 1970. Among farm 
teens in school, participation rates fell by 34 
percentage points between 1950 and 1970, 
but the participation rate of nonfarm teens 
enrolled in school remained stable at slightly 
over 20 percent. In contrast to the pre-1950 
period, participation rates of farm teen 
dropouts fell by 23 percentage points be- 


tween 1950 and 1970. Participation rates of 
nonfarm teen dropouts continued to fall as 
they had before 1950, but at a faster rate, 
with most of the decline concentrated in the 
1950’s. Thus, reversing the patterns for the 
pre-1950 era, from 1950 to 1970 the overall 
participation rate for farm teens fell more 
than twice as much as the rate for nonfarm 
teens.!° 


‘See Robert Mare and Christopher Winship (1984) 
for a detailed analysis of post-1960 interactions be- 
tween black school attendance and labor-force partici- 
pation, based on CPS data. 


239 
- 
81.2 70.6 76.9 
73.3 60.6 68.2 
(73.4) 
85.5 76.4 81.8 
(18.5) 
58.9 42.1 48.8 
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Table 1 suggests that increasing rates of 
school enroliment and reduced labor-force 
activity among black teens in school were 
proximate causes of the pre-1950 decline in 
participation. The hypothesis is confirmed 
in Table 2, which computes participation 
rates for 1900 and 1950 under various as- 
sumptions about school enrollment rates 
and participation conditional on enrollment 
status. Had enrollment rates (farm and non- 
farm) in 1900 been equal to their 1950 val- 
ues, the overall participation rate in 1900 
would have been 77 percent. Had enroll- 
ment rates and participation rates of in- 
school black teens in 1900 equaled their 
1950 values, the overall participation rate 
would have been 68.2 percent. The differ- 
ence between the actual 1900 participation 
rate and the latter counterfactual rate ac- 
counts for approximately three-quarters of 
the decline in participation between 1900 
and 1950. Changes in participation among 
dropouts were less important, explaining 
about 18 percent of the decrease in partici- 
pation before 1950. 

Increasing school enrollments and de- 
creasing participation rates of in-school 
teens continued to be important after 1950, 
but the significance of changes in participa- 
tion among dropouts was greater after 1950 
than before. Because nonfarm participation 
rates were lower than farm participation 
rates, the shift of the population out of 
agriculture between 1950 and 1970 (35 per- 
centage points) also helped to lower the 
overall participation rate.'! 

The sharp decline after 1950 in participa- 
tion among farm teen dropouts is consistent 
with Cogan’s (1982) argument that cotton 
mechanization reduced employment oppor- 
tunities for black teens in Southern agricul- 
ture. The post-1950 shift of black labor out 
of agriculture, the increase in school enroll- 
ment, and the decrease in participation 


'l Interaction effects were also important after 1950. 
The fraction of the post-1950 decline in the overall 
participation rate attributed to increased enrollments 
and lower in-school participation rates (87 percent), 
when added to the fraction attributed to lower partici- 
pation of dropouts (42 percent), exceeds 100 percent. 
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among in-school teens might also be at- 
tributed to cotton mechanization, except 
that downward trends in these variables be- 
gan long before 1950.'? What needs to be 
explained are changes in school enrollments 
and the high participation rate of black teens 
in school prior to 1950. 

We hypothesize that changes before 1950 
in school enrollment rates and in participa- 
tion rates of black teens in school were 
generated, in part, by rising expectations of 
black parents that their children complete 
more years of schooling before entering the 
labor force permanently. At the turn of the 
century black parents in the South were 
poor, poorly educated, and heavily rural. 
Poverty, adult illiteracy, the perception that 
the returns to schooling were low in agricul- 
ture, the value of teen labor on family farms, 
and poor-quality schools conspired to keep 
the typical black child out of school at later 
ages. In 1910, for example, the peak age of 
black school enrollment in the South was 
age 11 (71 percent), and enrollment rates 
fell off sharply at age 14 (U.S. Bureau of the 
Census, 1918 p. 384). The upshot was that 
Southern blacks educated in the late 19th 
and early 20th centuries finished very few 
years of schooling before entering the labor 
force: an average of about five years for the 
birth cohorts from 1886 to 1905 (James 
Smith, 1986 p. 1227).!° 

That the number of years of schooling 
desired was so low can account for the low 
enrollment rate of black teens in 1900, but 
it cannot explain why the labor-force partic- 
ipation of in-school black teens was so high 
in the early 20th century. A child whose 


'2In addition, if technical change in Southern agri- 
culture (other than tractorization) were solely responsi- 
ble for the pre-1950 trends in enrollments, percentage 
on farms, and participation rates of in-school farm 
teens, it is difficult to explain why the participation rate 
of farm teen dropouts did not decline until after 1950. 

'3Smith’s figure pertains to the Southern black pop- 
ulation. For black males it is biased upward because 
black female children were more likely to attend school 
than black male children around the turn of the cen- 
tury; it is biased upward more generally because the 
census overstated black educational attainment at the 
turn of the century (Margo, 1990 Ch. 2). 
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education stopped at the sixth grade could, 
in theory, finish his schooling by age 12 or 
13 and then enter the labor force. In fact, a 
significant fraction of Southern black chil- 
dren suffered from high rates of age-in-grade 
retardation and simply did not complete a 
given number of years of schooling in the 
customary amount of time (Finis Welch, 
1973). According to the 1940 census, for 
example, 47 percent of Southern black males 
enrolled in school at age 12 were attending 
the fourth grade or a lower grade (U.S. 
Bureau of the Census, 1943a p. 126). Al- 
though age-in-grade retardation in the pre- 
teen years had many causes, there is little 
doubt that a major factor was the poor 
quality of elementary schools attended by 
Southern black children (Horace Mann 
Bond, 1934 pp. 344-56; Finis Welch, 1973). 

As long as the parental demand for child’s 
schooling was low, age-in-grade retardation 
at an early age was not a binding constraint. 
Completion of a given number of years of 
schooling could be spread over several years 
and be readily combined with labor-force 
participation at later ages. Once desired 
years of schooling reached modern norms, 
however, enrollment rates at later ages in- 
creased, and participation of in-school teens 
fell. A black teen struggling to finish his 
schooling with the sixth grade could work 
and still expect to achieve his goal before 
leaving home. However, a black teen whose 
parents wished him to complete high school 
would, under normal circumstances, gradu- 
ate at age 18. He could ill afford to fall 
several years behind, at an early or a later 
age, and his parents could best accommo- 
date their wishes by keeping him out of the 
labor force as much as possible during his 
teenage years. 

Over time, more black parents sought to 
have their children complete higher levels 
of schooling, while improvements in the 
quality of black elementary schools, particu- 
larly the length of the school year, lessened 
the extent of early age-in-grade retarda- 
tion.'* The reasons for the increased 


'S Using state-level data for Southern states in 1940 
and 1950, we estimated a regression of the change in 
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parental demand are not fully understood, 
but reductions in adult illiteracy and pov- 
erty clearly played important roles.!> 


Cross-sectional analysis of data from the 
early-20th-century South reveals significant 
positive effects of adult literacy and the 


the retardation rate at age 12 (the percentage of en- 
rolled black 12-year olds attending the fourth grade or 
some lower grade) between 1940 and 1950. The inde- 
pendent variables were the change in the length of the 
school year in days, the change in the pupil-to-teacher 
ratio, and the change in the average daily attendance 
rate. Delaware was omitted because the retardation 
rate was not available for 1950. Data were weighted by 
the population counts (enrolled 12-year-olds) in each 
state. The retardation rate was calculated from U.S. 
Bureau of the Census (1943b) and U.S. Bureau of the 
Census (1952b). The sources for the length of the 
school year and the pupil-to-teacher ratio were Federal 
Security Agency (1947) and U.S. Department of Health, 
Education, and Welfare (1954). The coefficient for the 
length of the school year was —0.0029 (¢ statistic = 
— 1.85). The coefficients of the pupil-to-teacher ratio 
and the daily attendance rate had the wrong signs 
(negative and positive, respectively), but neither was 
statistically significant. The retardation rate at age 12 
fell by 6.9 percentage points between 1940 and 1950, 
while the average length of the school year rose by 17.4 
days. Based on the regression, we estimate that 73 
percent (= —0.0029 x 17.4/0.069) of the decrease in 
the retardation rate between 1940 and 1950 is ex- 
plained by the increase in the length of the school year. 

‘Three other factors are worth noting. A significant 
part of the returns to schooling for Southern blacks 
was migration to the North, where real wages were 
higher. Better-educated blacks were more likely to 
leave the South than less-educated blacks (Margo, 1990 
Ch. 7). Migration to the North became more feasible 
with the establishment of black communities in North- 
ern cities (Wright, 1986), and awareness of the eco- 
nomic benefits of migration may have prompted some 
black parents to keep their children in school at later 
ages. Second, violations of the separate-but-equal doc- 
trine were much worse at the high-school level than at 
the elementary level in the Scuth in the early 20th 
century. Few black teenagers in the rural South in 1910 
could have attended a local public high school even if 
they wanted to, because the number of such schools 
was extremely small (Margo, 1990 p. 20). Provision of 
black high schools became more common, however, 
after World War I. Third, there is evidence of “‘catch- 
up” behavior among Southern black parents; that is, 
holding constant family background, school quality, 
and local labor-market characteristics, black parents 
were more likely to send their children to school than 
were white parents (Margo, 1990 pp. 79-80). 
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TABLE 3—ENROLLMENT AND LABOR-FORCE PARTICIPATION REGRESSIONS: 
SOUTHERN BLACK MALE TEENS, 1950 
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Enrollment rate 


Labor-force participation 
of in-school teens 


Independent 
variable WLS WLS 2SLS WLS WLS 2SLS 
Constant 0.508 0.574 0.580 0.239 0.078 0.066 
(23.989) (13.156) (10.558) (8.372) (1.592) (1.538) 
Elementary —0.277 —0.298 0.662 0.713 
enrollment ratio (-— 1.739) (-1.421) (3.692) (2.827) 
Rural nonfarm —0.094 -0.050 —-—0.047 0.072 —-0.111 


(—5.337) (— 1.667) (—1.268) (0.030) (—3.043) (—2.489) 


Farm —0.103  -0.024 -0.018 0.227 0.038 0.023 
(—6.728) (—0.495) (—0.289) (10.989) (0.703) (0.316) 
Mean value of 
dependent variable: 0.43 0.33 
R?: 0.77 0.78 0.78 0.91 0.91 0.90 
N: 40 40 


Notes: WLS denotes weighted least squares; 2SLS denotes weighted two-stage least 
squares. Numbers in parentheses are f statistics. 


Source: See Data Appendix and text. 


household head’s occupational and home- 
ownership status (proxies for income and 
wealth) on the school attendance of black 
children (Margo, 1990 pp. 13, 77).'° 

The regressions in Table 3 provide a test 
of our hypothesis. The data are within-state 
aggregates for farm, rural nonfarm, and ur- 
ban black teens in the South in 1950. The 
dependent variables are the school enroll- 
ment rate and the labor-force participation 
rate of in-school black teens. The key inde- 
pendent variable is the elementary enroll- 
ment ratio: the percentage of total school 
enrollment among black male teens ac- 
counted for by the elementary grades. 
Dummy variables are included for states 
and for rural nonfarm and urban status to 
control for a variety of factors that may 
have influenced participation and school en- 
rollment but which cannot be measured di- 


'©Compulsory-schooling legislation also raised black 
school attendance, but the effect was small (see Margo, 
1990 p. 13). 


rectly.'’ The coefficients in the columns la- 
beled WLS are estimated using weighted 
least squares; the weights are population 
counts. Data are drawn from the state vol- 
umes of the 1950 census (see the Data Ap- 
pendix). 

As our hypothesis predicts, the coefficient 
of the elementary enrollment ratio is nega- 
tive in the enrollment regression and posi- 
tive in the labor-force participation regres- 
sion. Every 10-percentage-point decrease in 
the elementary enrollment ratio increases 
the enrollment rate by 2.8 percentage points 
and decreases the labor-force participation 
rate of in-school teens by 6.6 percentage 
points. Controlling for the elementary ratio 
substantially accounts for the large differ- 
ences in labor-force participation and school 
enrollment between the farm, rural non- 
farm, and urban populations evident in the 
regressions excluding this variable. 


"The inclusion of state and within-state sectoral 
dummies is analogous to a fixed-effects regression. In 
our application the fixed effects are states, and the 
within-state dummies correspond to time dummies. 
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An objection to the weighted-least- 
squares estimates is that the elementary en- 
rollment ratio might be correlated with the 
error term. The number of teens enrolled in 
school is the denominator of the elementary 
enrollment ratio and also the numerator of 
the enrollment rate and the denominator of 
the labor-force participation rate. Measure- 
ment error in enrollments would cause a 
spurious negative correlation between the 
enrollment rate and the elementary enroll- 
ment ratio and a spurious positive correla- 
tion between the labor-force participation 
rate and the elementary enrollment ratio. 
Because the elementary enrollment ratio 
was constructed from the state volumes of 
the 1950 census, which drew upon a 20- 
percent random sample of the population, 
there is reason to suspect measurement 
error. 

The regressions labeled 2SLS in Table 3 
were estimated using weighted two-stage 
least squares. The instruments for the ele- 
mentary enrollment ratio were the median 
years of schooling of males aged 45-54, 
median household income (education of 
adult males and household income are prox- 
ies for parental demand), the retardation 
rate at age 10 (the fraction of 10-year-olds 
enrolled in the third grade or lower, a proxy 
for school quality), and the state and sec- 
toral dummies. Although the significance 
level of the elementary enrollment ratio in 
the enrollment regression decreases, the 
two-stage coefficients have the same signs 
and are slightly larger than the weighted- 
least-squares coefficients.'* We conclude 
that our substantive findings are unchanged, 


'SWe also experimented with different combinations 
of instruments, including the median years of schooling 
of females aged 45-54 (in addition to, or instead of, 
median household income) and the retardation rate at 
age 9 (instead of the retardation rate at age 10). The 
significance level of the two-stage coefficient of the 
elementary ratio in the enrollment regression was sen- 
sitive to the choice of instruments, but the sign of the 
coefficient was always negative, and the magnitude of 
the coefficient was always close to the value reported in 
Table 3. The significance level, sign, and magnitude of 
the coefficient of the elementary ratio in the participa- 
tion regression were robust to the choice of instru- 
ments. 
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allowing for the possible endogeneity of the 
elementary enrollment ratio. 

We use the weighted-least-squares re- 
gression coefficients to compute counterfac- 
tual enrollment and participation rates 
based on different assumptions about the 
value of the elementary enrollment ratio. In 
1950, the value of the elementary ratio for 
Southern black male teens was 0.376 (U.S. 
Bureau of the Census, 1953 p. 350). Had the 
ratio in 1950 been equal to its value in 1970 
(8.5 percent; calculated from U.S. Bureau 
of the Census [1973a p. 1116]) the estimated 
participation rate would have been 46.1 per- 
cent.'? The difference between the actual 
and counterfactual 1950 participation rates 
accounts for 54 percent (= 14.7/27) of the 
decline in black teen participation between 
1950 and 1970. 

In 1940, the elementary enrollment ratio 
for Southern black male teens was 51.2 per- 
cent (U.S. Bureau of the Census, 1943a 
p. 127). Comparable figures for the pre- 
1940 period are not available, but it is possi- 
ble to estimate the ratio from information 
on the age structure of enrollments and 
the number of pupils enrolled in high 
school. Using these data we estimate that 
the ratio was 58.9 percent in 1930.7? Thus 
the percentage of black teens enrolled in 


Based on the weighted-least-squares coefficients 
in Table 3, we calculate the average value of the 
enrollment and labor-force participation rates for farm, 
rural nonfarm, and urban sectors, setting the elemen- 
tary enrollment ratio equal to 0.085 (the value for 
1970). We assume that the sector-specific labor-force 
participation rates for black teen dropouts equal their 
actual 1950 values. Using the 1950 proportions of black 
teens in each sector as weights, we calculate the overall 
participation rate. 

~’ We calculated the elementary enrollment ratio in 
1930 by subtracting the number of Southern black 
pupils enrolled in high school (from U.S. Department 
of the Interior, 1932) from the total number of South- 
ern blacks, aged 16-19, attending school in 1930 (from 
U.S. Bureau of the Census, 1935 p. 213). The estimate 
is biased downward for males because the data on 
enrollments by grade are not gender-specific, and black 
female children were more likely to attend school than 
black male children. The ratio is also biased downward 
because we assume that no 15-year-olds were enrolled 
in high school, when some were. The latter bias is 
offset, however, by the implicit assumption that no 
19-year-olds were enrolled in college. 


the high-school grades had been increasing 
for at least two decades prior to 1950, pro- 
ducing a decline in labor-force participation 
that could have been expected to continue 
after 1950.7! 

Racial and regional differences in black 
teen participation can also be partly at- 
tributed to differences in the elementary 
enrollment ratio. The ratio for Southern 
white male teens was 11.8 percent in 1950 
(U.S. Bureau of the Census, 1953 p. 350). 
Had there been no racial difference in the 
ratio in the South in 1950, we estimate that 
48 percent of black male teens would have 
been participating in the labor force, 7 per- 
centage points Jess than the participation 
rate of Southern white male teens in 1950. 
The fact that black teens were less likely to 
participate in the labor market than white 
teens, holding the elementary enrollment 
ratio constant, suggests that racial discrimi- 
nation may have played a role in limiting 
labor-market opportunities in the South be- 
fore 1950 (Margo, 1990 Ch. 6). 

In 1950 the elementary enrollment ratio 
for non-Southern black teens was 15.7 per- 
cent (calculated from U.S. Bureau of the 
Census [1953 pp. 345, 348, 354]). Had the 
ratio for Southern black teens equaled the 
non-Southern value, their participation rate 
is predicted to have been 50 percent. Since 
the participation rate of non-Southern black 
teens was 40.4 percent in 1950 (calculated 
from U.S. Bureau of the Census [1953 
pp. 366—73]), the regional difference in the 
elementary enrollment ratio explains 53 
percent (= 10.8/20.4) of the regional dif- 
ference in participation rates. 


III. Conclusion 


The decline in labor-force participation 
among black male teens in the South be- 


*"If the value of the elementary ratio in 1950 had 
equaled its estimated value in 1930, the predicted 
participation rate in 1950 is 69.9 percent. The decline 
in participation between 1930 and 1950 was 13.1 per- 
centage points. Thus the decrease in the elementary 
enrollment ratio between 193C and 1950 explains 70 
percent (=9.1/13.1) of the decline in participation 
between 1930 and 1950. 
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tween 1950 and 1970 has frequently been 
attributed to post-1950 effects of cotton 
mechanization and the minimum wage. We 
have shown, however, that the post-1950 
decline in participation was partly the con- 
tinuation of a secular trend. Parental de- 
mand was an important factor behind the 
secular trend: as black parents desired 
higher levels of educational attainment for 
their children than had been the norm early 
in the century, low rates of teenage school 
enrollment and high rates of labor-force 
participation among in-school teens were no 
longer feasible. 

The limitations of our argument should 
be recognized. Increased parental demand 
is not responsible for the post-1950 decline 
in participation among farm and nonfarm 
dropouts, or the pre-1950 decline in partici- 
pation among nonfarm dropouts.” Parental 
demand cannot fully account for racial and 
regional differences in participation in 1950. 
Productivity growth in the Southern econ- 
omy before 1950, including agriculture, 
raised the incomes of black parents and 
thus contributed indirectly to the secular 
trend in participation. There is compelling 
evidence that the minimum wage had severe 
disemployment effects on black teens in cer- 
tain Southern industries (e.g., sawmills) af- 
ter 1950 (Cogan, 1982). The important point, 
however, is that debate over the causes of 
the post-1950 decline in black teen partici- 
pation should no longer ignore the pre-1950 
trend and its causes. 


Data APPENDIX 


All data are available from Robert A. Margo on a 
floppy disk. 


The precise timing of the pre-1950 decline in 
participation among nonfarm dropouts cannot be de- 
termined from the published census volumes. For the 
country as a whole, the participation rate of nonfarm 
black teen dropouts was 89.6 percent in 1900 (com- 
puted from the public-use sample of the 1900 census 
[Center for Studies in Demography and Ecology, 1980]) 
and 83.1 percent in 1930 (U.S. Bureau of the Census, 
1935 p. 226). This suggests that the minimum wage 
cannot be fully responsible for the pre-1950 decline 
in participation among nonfarm dropouts, because 
some of the decline occurred before the passage of 
minimum-wage legislation. 
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Labor-Force Participation Rates.—Prior to 1940 the 
census identified persons aged 10 and over who re- 
ported a gainful occupation. Participation rates for 
1900, 1920, and 1930 are the fractions of Southern 
black males, aged 16-19, reporting a gainful occupa- 
tion. The participation rate for 1900 (Fig. 1) was calcu- 
lated from the public-use sample of the 1900 census 
(Center for Studies in Demography and Ecology, 
1980).2> Participation rates for 1920 and 1930 were 
calculated from state-level census data on population 
and occupations by age (U.S. Bureau of the Census, 
1922 pp. 189-285; 1923 pp. 446-51, 515-18; 1933 table 
9; 1935 pp. 108-9). The participation rates for 1900, 
1920, and 1930 pertain to blacks. 

Participation rates for 1940-1960 pertain to the 
labor-force activity of nonwhite male teens in the South 
during the census week. For 1970 the participation rate 
pertains to black teens exclusively. Virtually no bias is 
introduced by the distinction between nonwhites and 
blacks because blacks accounted for the overwhelming 
proportion of nonwhites in the South between 1940 
and 1960. The participation rate for 1940 was obtained 
from U.S. Bureau of the Census (1943a p. 153). The 
state volumes of the 1950-1970 censuses are suffi- 
ciently detailed to calculate participation rates by en- 
rollment status and, for most Southern states, within 
the urban, rural nonfarm, and farm sectors, which can 
then be aggregated (U.S. Bureau of the Census, 1952b 
table 71; 1963 table 117; 1973b table 166). 

The South consists of 16 states (Alabama, Arkansas, 
Delaware, Florida, Georgia, Kentucky, Louisiana, 
Maryland, Mississippi, North Carolina, Oklahoma, 
South Carolina, Tennessee, Texas, Virginia, and West 
Virginia) plus the District of Columbia. 

State-Level Minimum-Wage Coverage Rates in 
1950.—For the census week in 1950 we estimated the 
fraction of employed black teenage males, aged 16-19, 
in each Southern state who were covered by the mini- 
mum-wage provisions of the Fair Labor Standards Act. 
The published volumes of the 1950 census present data 
on the occupation distribution of nonwhite males, clas- 
sified by age. The occupational breakdowns consist of 
180 categories, including major industry subsets of op- 
eratives and laborers. A large share of black teenage 
males were employed in the small number of cate- 
gories for which industry-occupation coverage rates 
could be estimated from other sources.” 


3Because the number of nonfarm Southern black 
teens in the 1900 census sample was very small, their 
participation rates by enrollment status and the enroll- 
ment rate (see Table 1) were calculated by combining 
the 1900 and 1910 public-use samples (Population 
Studies Center, 1989). 

"Delaware is excluded from all analyses of 1950 
data because no state-level data on nonwhites were 
reported. 

These sources are: (i) table 14 of U.S. Department 
of Labor (1948), which shows total nonsupervisory em- 
ployment and the number of employees subject to the 
minimum-wage provisions of the Fair Labor Standards 
Act by industry group in September 1947; (ii) Charles 
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The first step in calculating the coverage rate was to 
use U.S. Bureau of the Census (1956a) to identify each 
occupational group in the state volumes that contained 
at least 0.2 percent of all employed nonwhite teenage 
males in 1950. Second, for occupational groups that 
were entirely (or almost entirely) employed in a single 
industry (e.g., sawyers, plasterers) we assigned the cov- 
erage ratio from U.S. Department of Labor (1948). By 
far the biggest employer of black male teens in 1950 
was agriculture, for which the coverage rate in 1950 
was zero. 

Third, for occupational groups that were employed 
in a number of industries but for which no industrial 
breakdown was given in the state volumes (e.g., bakers, 
truck drivers, janitors) we used U.S. Bureau of the 
Census (1956b) to estimate the industry—occupation 
matrix.°° Because we know the coverage rates for each 
industry (and we have allocated occupations to indus- 
try) we can estimate covered employment by occupa- 
tion. The overall coverage rate is the sum of covered 
employment across occupations divided by the number 
of employed black teens. 

Table 3.—All data are from U.S. Bureau of the 
Census (1952b). 


REFERENCES 


Alston, Lee, Costs of Contracting and the 
Decline of Tenancy in the South, New 
York: Garland, 1985. 

Bond, Horace Mann, The Education of the 
Negro in the American Social Order, New 
York: Prentice-Hall, 1934. 

Cogan, John, ““The Decline in Black Teenage 
Employment, 1950-1970,” American 
Economic Review, September 1982, 72, 


H. Livengood (1951), which identifies the categories of 
employees who were exempt from the minimum-wage 
provisions as of 1950; (iii) table 7 of U.S. Bureau of the 
Census (1956a), which cross-classifies employed non- 
white males in the United States by age and detailed 
occupation (in 1950 more than 85 percent of all em- 
ployed black teens lived in the South); (iv) table 2 of 
U.S. Bureau of the Census (1956b), which provides a 
cross-classification of all employed males (all ages and 
races combined) by detailed occupation and a partial 
breakdown of industries. 

©The industrial classifications in U.S. Bureau of the 
Census (1956b) refer to all employed males. Because 
black teens are likely to be underrepresented relative 
to all employed males in highly covered industries (e.g., 
durable-goods manufacturing) the coverage rates are 
biased upward. 


621-38. 

Day, Richard, “The Economics of Technolog- 
ical Change and the Demise of the Share- 
cropper,” American Economic Review, 
June 1967, 57, 427-49. 

Durand, John, The Labor Force in the United 
States, 1890-1960, New York: Social Sci- 
ence Research Council, 1948. 

Lebergott, Stanley, Manpower in Economic 
Growth: The American Record Since 1800, 
New York: McGraw-Hill, 1964. 

Livengood, Charles H., The Federal Wage and 
Hour Law, Reflecting the Fair Labor Stan- 
dards Amendments of 1949, Philadelphia, 
PA: American Law Institute, 1951. 

Mare, Robert D. and Winship, Christopher, 
“The Paradox of Lessening Racial In- 
equality and Joblessness Among Black 
Youth: Enrollment, Enlistment, and Em- 
ployment, 1964-1981,” American Socio- 
logical Review, February 1984, 49, 39-55. 

Margo, Robert A., Race and Schooling in the 
South, 1880-1950: An Economic History, 
Chicago: University of Chicago Press, 
1990. 

Musoke, Moses, “Mechanizing Cotton Pro- 
duction in the United States: The Tractor 
1915-1960,” Explorations in Economic 
History, October 1981, 18, 347-75. 

Osterman, Paul, Getting Started: The Youth 
Labor Market, Cambridge, MA: MIT 
Press, 1980. 

Smith, James, “Race and Human Capital: 
Reply,” American Economic Review, De- 
cember 1986, 76, 1225-9. 

Welch, Finis, “Education and Racial Dis- 
crimination,” in Orley Ashenfelter and Al 
Rees, eds., Discrimination in Labor Mar- 
kets, Princeton, NJ: Princeton University 
Press, 1973, pp. 43-81. 

Whatley, Warren, “Labor For the Picking: 
The New Deal in the South,” Journal of 
Economic History, December 1983, 43, 
905-29. 

, “Southern Agrarian Labor Con- 
tracts as Impediments to Cotton Mecha- 
nization,’ Journal of Economic History, 
March 1987, 47, 45-70. 

Wright, Gavin, Old South, New South, New 
York: Basic Books, 1986. 

Center for Studies in Demography and Ecology, 
University of Washington, United States 


246 THE AMERICAN ECONOMIC REVIEW 


MARCH 1993 


Census Data, 1900: Public Use Sample, 
Ann Arbor, MI: Inter-university Consor- 
tium for Political and Social Research, 
1980. 

Federal Security Agency, U.S. Office of Educa- 
tion, Biennial Surveys of Education in the 
United States, 1938-40 and 1940-42, 
Washington, DC: U.S. Government Print- 
ing Office, 1947. 

Population Studies Center, University of Penn- 
sylvania, Public Use Sample: 1910 Census 
of Population, Ann Arbor, MI: Inter- 
university Consortium for Political and 
Social Research, 1989. 

U.S. Bureau of the Census, Negro Population, 
1790-1915, Washington, DC: U.S. Gov- 
ernment Printing Office, 1918. 

, Fourteenth Census of the United 
States Taken in the Year 1920, Volume II. 
Population 1920, General Report and Ana- 
lytical Tables, Washington, DC: U.S. Gov- 
ernment Printing Office, 1922. 

______, Fourteenth Census of the United 
States Taken in the Year 1920, Volume IV. 
Population 1920, Occupations, Washing- 
ton, DC: U.S. Government Printing Of- 
fice, 1923. 

___, Fifteenth Census of the United States 
1930, Unemployment, Volume 1, Washing- 
ton, DC: U.S. Government Printing Of- 
fice, 1931. 

, Fifteenth Census of the United States 
1930, Population, Volume IV. Occupations, 
By States, Washington, DC: U.S. Govern- 
ment Printing Office, 1933. 

, Negroes in the United States, 
1920-1932, Washington, DC: U.S. Gov- 
ernment Printing Office, 1935. 

_____, (1943a) Sixteenth Census of the 
United States: 1940, Population, Volume 
IV. Characteristics by Age, Part 1, United 
States Summary, Washington, DC: U.S. 
Government Printing Office, 1943. 

, (1943b) Sixteenth Census of the 
United States: 1940, Population, Volume 
IV. Characteristics by Age, Parts 2, 3, and 
4, Washington, DC: U.S. Government 
Printing Office, 1943. 

, (1952a) 1950 Census of Agriculture, 
Volume II. General Report, Statistics by 
Subject, Washington, DC: U.S. Govern- 
ment Printing Office, 1952. 


VOL. 83 NO. 1 


, (1952b) Census of Population: 1950, 
Volume II. Characteristics of the Popula- 
tion, Parts 2, 3, and 4, Washington, DC: 
U.S. Government Printing Office, 1952. 

, Census of Population: 1950, Volume 
II. Characteristics of the Population, Part 
1, United States Summary, Washington, 
DC: U.S. Government Printing Office, 
1953. 

,(1956a) United States Census of Pop- 
ulation: 1950. Special Reports: Occupa- 
tional Characteristics, Washington, DC: 
U.S. Government Printing Office, 1956. 

, (1956b) United States Census of 
Population: 1950. Special Reports: Occupa- 
tion By Industry, Washington, DC: U.S. 
Government Printing Office, 1956. 

, Census of Population: 1960, Volume 
i. Characteristics of the Population, Part 2, 
Washington, DC: U.S. Government Print- 
ing Office, 1963. 


MARGO AND FINEGAN: BLACK LABOR-FORCE PARTICIPATION 247 


____, (1973a) 1970 Census of Population, 


Volume 1. Characteristics of the Popula- 
tion, Part I; United States Summary, Sec- 
tion 2, Washington, DC: U.S. Govern- 
ment Printing Office, 1973. 


_, (1973b) 1970 Census of Population, 


Volume 1. Characteristics of the Popula- 
tion, Part 2, Washington, DC: U.S. Gov- 
ernment Printing Office, 1973. 


U.S. Department of Health, Education, and Wel- 


fare, Biennial Survey of Education in the 
United States, 1948-50, Washington, DC: 
U.S. Government Printing Office, 1954. 


U.S. Department of Interior, Office of Education, 


Biennial Survey of Education in the United 
States, 1928-30, Washington, DC: US. 
Government Printing Office, 1932. 


U.S. Department of Labor, Annual Report of 


the Wage and Hour and Public Contracts 
Division, Washington, DC: U.S. Govern- 
ment Printing Office, 1948. 


Causes of Intercity Variation in Homelessness 


By MarsoriE HoniG AND RANDALL K. FILER* 


Homelessness in America has become a 
major policy concern in recent years. Fol- 
lowing estimates by a number of researchers 
in the 1980’s that suggested as many as a 
half-million homeless,' the 1990 census 
places the current number at around a 
quarter of a million. Efforts to design poli- 
cies to deal with this problem have been 
handicapped by a lack of systematic analy- 
sis of the causes of homelessness.” Policy- 
makers have had little guidance from 
researchers in determining the relative im- 
portance of such potential causes of home- 
lessness as tight housing markets, slack 
labor markets, reductions in real public- 
assistance benefits, tightening of eligibility 
requirements for public assistance, and non- 
institutionalization of the mentally ill. 

To assess the relative importance of these 
and other factors, we used estimates by the 
Department of Housing and Urban Devel- 
opment (HUD) of the homeless population 
in a cross-section of metropolitan areas in 
1984. We do not attempt in this paper to 
develop a full structural model of homeless- 


*Department of Economics, Hunter College / 
CUNY, 695 Park Avenue, New York, NY 10021. This 
research was supported by the Institute for Research 
on Poverty under the Small Grants Program and the 
University Committee on Research of the City Univer- 
sity of New York. The authors thank Steve Bartolomei- 
Hill, Howard Chernick, Steven Craig, Martha Hill, 
Charles Manski, Kathryn Nelson, and Cordelia Reimers 
for helpful comments, and Franco Pignataro for his 
unusually meticulous programming assistance. 

See Filer and Honig (1990) for a detailed discus- 
sion of estimates of the size and growth of the home- 
less population. 

“Most studies have focused on the size of the home- 
less population and its demographic characteristics. A 
few analyses have examined a limited range of poten- 
tial causes of homelessness. F. Stevens Redburn and 
Terry F. Buss (1986) examined the roles of population 
growth, climate, and housing conditions; William 
Tucker (1987) and John M. Quigley (1990) focused on 
the role of rent control. Karin Ringheim (1990) ana- 
lyzed in depth a small sample of metropolitan areas. 
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ness. Rather, we present reduced-form 
findings as a first attempt to measure com- 
prehensively and systematically the underly- 
ing causes of homelessness.* 

We assume in this paper that homeless- 
ness results from an imbalance between the 
cost of available housing and a household’s 
income. Such an imbalance may occur, for 
example, when housing markets are tight 
relative to labor markets and housing costs 
are therefore high relative to earnings (or to 
alternative resources, such as public assis- 
tance). Investigation of the causes of home- 
lessness must go beyond housing markets 
alone, however, because of the special char- 
acteristics of the population at risk and the 
public policies that address their needs. 
Transfer payments and policies regarding 
institutionalization of the mentally ill, for 
example, should be important determinants 
of the incidence of homelessness but are 
not part of a standard housing model. Since 
homelessness represents the end of a spec- 
trum of poor housing outcomes, we also 
estimate equations for two related condi- 
tions, “crowded” and “doubled-up” hous- 
ing. These are often cited as causes of 
homelessness but are, in fact, different man- 
ifestations of the same underlying relation- 
ship between housing costs and household 
resources.’ 


*We view this effort as preliminary to a more de- 
tailed analysis we intend to undertake when complete 
data from the 1990 Census, which enumerates the 
homeless population, become available. 

Most previous economic analyses of housing mar- 
kets have not focused on the decision of whether to 
share housing. Demographers, on the other hand, have 
long studied this issue when examining family forma- 
tion and living arrangements. Among economists, a 
notable exception is the study by Axel Borsch-Supan 
and John Pitkin (1988), who focused on the choice 
between renting, purchasing, and sharing housing. We 
are concerned in this paper with the other margin, 
where individuals and families chose between sharing 
housing and entering the homeless shelter system. Such 
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I. Data 


Although there has been considerable 
controversy concerning the reliability of the 
HUD data,” HUD’s estimate of approxi- 
mately 250,000-—350,000 homeless individu- 
als nationwide has been supported by sev- 
eral subsequent studies. These have used a 
variety of estimation strategies but generally 
have arrived at numbers in the range of the 
HUD estimates.° While HUD’s local area 
estimates are likely to be imprecise, it is 
sufficient for our purposes that any errors 
are random across cities. We find no evi- 
dence that this is not the case. Further- 
more, in almost every city where indepen- 
dent researchers conducted a full count of 
the homeless at the same time as the HUD 
survey, the enumeration fell within the most 
reliable range reported by HUD.’ Our 


a model was estimated using individual-level data by 


Renya Reed (1992). She suggests that most families 
prefer shared housing and enter the current homeless 
shelter system when constrained from this option. 
These questions can be further developed once the 
micro-level data from the 1990 Census are available. 

’See Redburn and Buss (1986) for a useful summary 
of the methodology and estimation issues, and see 
Richard P. Appiebaum (1987) and Lynn Parsons (1986) 
for methodological critiques. 

°HUD used a mid-range based on four independent 
methods of arriving at a national estimate of between 
250,000 and 350,000 homeless individuals in 1984. 
These methods included estimates from local studies, 
500 key informant interviews in 60 metropolitan areas, 
surveys of 184 shelter operators in 60 metropolitan 
areas, and estimates of ratios of shelter and street 
populations (U.S. Department of Housing and Urban 
Development, 1984). An Urban Institute survey used a 
probability sample of service-using homeless individu- 
als and estimated a total of 567,000—600,000 in 1987 
(Martha R. Burt and Barbara E. Cohen, 1988; Burt, 
1988). A study by ICF, Inc., based on the HUD data 
but differentiating between the metropolitan, subur- 
ban, and rural incidence of homelessness, estimated a 
homeless population of 355,000 in 1984 (Committee for 
Food and Shelter, 1987). Richard B. Freeman and 
Brian Hall (1987) derived an estimate of 279,000 in 
1983, based on a New York City sample of the shelter 
population and estimates of the shelter-to-street ratio. 

7See, for example, the surveys by Frederic G. 
Robinson (1985) for Washington, DC, Mark LaGory et 
al. (1989) for Birmingham, AL, and Franklin J. James 
(1989) for Denver. The one exception to this general- 
ization is Peter H. Rossi (1989), who found consider- 
ably fewer homeless in Chicago than the HUD esti- 
mate. 
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|—ExtTeNtT oF Poor HousinGc OuTCOMES 
IN VARIOUS CiTIEs (RANKED ACCORDING 
rO EXTENT OF HOMELESSNESS) 


Homelessness Crowding Doubling-up 
per 100,000 per 100,000 per 100,000 
City population households households 


San Francisco, CA 535.1 
Los Angeles, CA 412.0 
Miami, FI 348.8 
New York, NY 346.2 
Chicago, II 323.9 
Worcester, MA 259.9 
Fort Wayne, IN 208.1 
Las Vegas, NV 205.0 
Houston, TX 200.7 
Seattle, WA 187.6 
Detroit, MI 173.8 
Reno, NV 147.7 
Richmond, VA 147.6 
Portland, OR 136.4 
Hartford, CT 112.2 
Little Rock, AR 103.5 
Davenport, IA 94.5 
Boston, MA 86.6 
Tampa, FI 2,566.3 1,202.2 
Philadelphia, PA 2,499.7 2,287.6 
Lincoln, NEI 830.8 451.0 
Birmingham, Al 4,195.1 1,733.3 
Phoenix 4,609.0 1,730.6 
Cincinnati, A 3,227.9 1,975.2 
Columbi: ae 1,652.5 381.4 
Louisville, KY 2,903.2 2,055.2 
Danville, VA 3,784.6 1,649.7 
Syracuse, NY 1,722.6 1,226.6 
Grand Rapids, MI 1,857.9 947.0 
Salt Lake City, UT 4,388.1 1,105.3 
Sioux City, IA 2,323.2 454.6 
Monroe, LA 5,136.1 1,571.5 
Minneapolis 

St. Paul, MN 
Raleigh / Durham, NC 
Pittsburgh, PA 
Pueblo, CO 
Jackson, MI 
Baton Rouge, LA 
Dayton, OH 
Athens, GA 
Baltimore, MD 
Tyler, TX 
Colorado Springs, CO 
Charlotte, NC 
Kansas City, MO 
Cleveland, OH 
Binghamton, NY 
Charleston, SC 
Rochester, NY 
Fall River, MA 


4,300.6 
10,106.6 
11,491.2 

6,389.6 1,673.9 

4,639.2 1,792.5 

1,735.4 938.8 

2,051.7 1,520.4 

4,383.8 1,688.7 

6,265.5 1,803.5 

1,833.9 1,011.7 

3,043.8 1,997.1 

2,901.8 1,165.7 

2,224.4 1,690.5 

1,862.0 866.5 

2,556.2 1,455.4 

3,744.9 1,892.2 

1,982.0 1,031.3 

2,429.8 1,159.5 


1,259.9 
1,803.1 
1,978.5 


1,623.8 527.7 
2,855.3 1,304.7 
1,964.8 1,539.2 
3,258.1 1,712.6 
1,769.9 919.0 
5,106.5 1,262.2 
1,770.2 1,442.6 
2,874.0 847.5 
2,853.2 2,197.8 
4,410.6 1,731.2 
2,090.4 939.9 
3,625.3 1,858.7 
2,092.8 1,446.2 
1,791.2 1,483.0 
1,305.8 1,282.1 
3,937.5 1,778.2 
1,255.1 1,116.5 
3,125.0 1,250.0 
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measures of crowding and doubling-up per- 
tain to all households in the 5-percent pub- 
lic-use sample of the 1980 Census. We use 
the standard definitions of “crowded” (more 
than one person per room) and “doubled- 
up” (households containing more than one 
nuclear family, defined as parents and their 
children related by blood or adoption) 
(Michael A. Stegman, 1988). 

Table 1 indicates the numbers of home- 
less individuals (per 100,000 population) 
as well as the numbers of crowded and 
doubled-up households (per 100,000 house- 
holds) for 50 of the 60 metropolitan areas 
surveyed by HUD; The incidence of home- 
lessness in these cities ranges from a high of 
535.1 per 100,000 in San Francisco to 6.8 in 
Fall River, MA. Unfortunately, the HUD 
data do not distinguish between the two 
major components of the homeless popula- 
tion: single males and members of families 
headed by females. There is evidence from 
surveys of homeless populations in various 
metropolitan areas that the proportion of 
families varies considerably across cities 
(U.S. Conference of Mayors, 1987), but we 
are unable to capture these differences in 
this study. We include, however, indepen- 
dent variables that might be related to either 
or both of these two types of homelessness. 


II. Results 


Table 2 presents ordinary least-squares 
estimations of the incidence of homeless- 
ness, crowding, and doubling-up (per 
100,000 population or households). The final 
column contains the population-weighted 
means and standard deviations of each vari- 
able. 


We experimented with a number of mea- 
sures to capture the characteristics of the 


’We have combined Raleigh and Durham, NC, into 
a single observation to be consistent with the Census 
treatment of these two cities, and we have been forced 
to exclude Scranton, PA, Annapolis, MD, Bowling 
Green, KY, Hazelton, PA, Lewiston, ME, Lompoc, 
CA, Merced, CA, and Monroe, MI, because data on 
some or all of the independent variables were not 
available. Washington, DC, was also excluded because 
statewide variables do not exist. 
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low-rent housing market.’ Rents were mea- 
sured at the mean and at various points in 
the distribution for all apartments and for 
apartments of various sizes. We also used 
vacancy rates that corresponded to each 
rent level, as well as the fraction of housing 
in rental units, the growth and age of the 
housing stock, and the presence of rent 
control.'° Among these, the level of rents at 
the tenth percentile of the rental distribu- 
tion for all apartments had the greatest 
impact on the incidence of homelessness, 
higher rents being associated with higher 
rates of homelessness. An increase of one 
standard deviation in rents at the tenth per- 
centile resulted in a predicted increase in 
homelessness of 78 persons per 100,000 
population, or about 42 percent of the mean 
level of homelessness across the metropoli- 
tan areas in our sample. Since housing is a 
highly competitive market where rents 
should reflect average costs, this suggests 
that public policies to reduce the cost of 
providing minimally adequate housing could 
have a powerful impact in reducing home- 
lessness. Among the obvious candidates for 
examination are building and zoning codes, 
tax assessment policies, and capital costs. 
Alternatively, increased rent subsidies could 
have the same impact, although with obvi- 
ous implications for tax burdens. 

Rents at the tenth percentile also af- 
fected the extent of crowding, but they were 
not significantly related to doubling-up. The 
corresponding vacancy rate (the fraction of 
apartments at or below the tenth percentile 
currently available for rent) appeared to be 
related to homelessness, although the esti- 
mated coefficient is not statistically signifi- 
cant at conventional levels. Vacancy rates 


°We assume that the alternative for individuals or 
families at risk of being homeless, or for those leaving 
crowded or doubled-up housing, is to rent rather than 
to purchasing housing. 

Most of the measures used are for the metropoli- 
tan area (SMSA) that contains the city for which HUD 
obtained data. Our ability to predict homelessness may 
be hampered by any cross-city variation in the relation- 
ship between the central city and the metropolitan 
area. We assume that labor and housing markets are 
metropolitan in scope but, for reasons of convenience 
and services, most homeless will locate in the core city 
of a metropolitan area. 
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TABLE 2—EsTIMATED CAUSES OF HOMELESSNESS, CROWDING, AND DousBLING-Up 


Independent variable 


Dependent variable 


Homeless Crowded Doubled-up Mean [SD] 


Rents at 10th percentile of 2.87 53.40 3.07 135.30 
all apartments (3.93) (4.33) (1.19) [27.26] 
Vacancy rate at 10th percentile — 872.90 952.58 3,141.06 0.059 
of all apartments (1.58) (0.10) (1.62) [0.024] 
Presence of rent-control law — 15.50 — 224.78 — 297.56 0.306 
(0.23) (0.20) (1.26) [0.461] 
Growth in employment, 1980-1982 —859.09 2,910.76 —1,302.90 — 0.016 
(2.71) (0.55) (1.17) [0.051] 
Share of employment in — 347.69 —9,365.56 1,272.58 0.272 
service industries (1.33) (2.12) (1.38) [0.044] 
Predicted size of low-skill — 1,003.87 87,087.77 — 775.71 0.417 
labor market (0.38) (1.98) (0.08) [0.007] 
Households below poverty line 0.013 0.72 0.024 11,074.50 
per 100,000 households (1.22) (4.03) (0.64) [2,287.28] 
Local government expenditures on 0.11 — §.21 —1.12 98.00 
public welfare per capita (0.59) (1.49) (1.54) [136.71] 
Maximum AFDC benefits, — 0.95 — 16.39 0.74 518.44 
family of three (2.58) (2.64) (0.57) [86.22] 
Maximum SSI benefits 1.07 24.00 3.60 359.98 
(2.14) (2.83) (2.04) [56.64] 
Percentage reduction in 146.62 1,628.93 —312.25 .208 
AFDC benefits if living (1.49) (0.98) (0.90) [0.242] 
with nonpoor parents 
AFDC accuracy rate, 1983 98.15 40,168.43 —2,859.33 0.932 
(0.13) (3.19) (1.09) [0.025] 
Mental health in-patients per — 0.83 — 0.03 2.63 102.37 
100,000 state population (1.50) (0.003) (1.35) [40.71] 
Fraction of births to — 1,173.00 14,645.25 9,633.92 0.128 
teenage mothers (1.39) (1.03) (3.20) [0.023] 
Blacks per 100,000 0.004 — 0.038 0.024 16,551.60 
population (1.78) (1.01) (3.03) [8,212.83] 
1984 population (100,000’s) 1.22 49.90 2.54 38.14 
(1.44) (3.50) (0.85) [28.11] 
Constant 307.54 —81,311.37 — 25.25 
Adjusted R?: 0.83 0.87 0.73 


Note: Numbers in parentheses are f¢ statistics. 
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did not seem to affect the incidence of 
crowding: moreover, they were positively 
correlated with doubling-up.'' Rent control, 
which has been cited as a cause of home- 
lessness (Tucker, 1987),'* had no effect on 
either homelessness or crowding but may 
have been associated with decreased dou- 
bling-up, although the effect is imprecisely 
estimated. Rents and vacancy rates at other 
points in the distribution and for apart- 
ments of various sizes had similar but less 
significant impacts on the housing out- 
comes. Other than these measures, no 
characteristic of housing markets had any 
observable effect on any of the outcome 
measures. 

Turning to local labor markets, recent 
growth in private-sector employment 
showed a strong negative relationship to 
homelessness. It may also have reduced the 
incidence of doubling-up, although here the 
relationship is not precisely estimated. The 
relative size of the service sector was less 
strongly related to homelessness but signif- 
icantly reduced crowding. In contrast, there 
was a positive, although not significant, rela- 
tionship between this aspect of the local 
labor market and the extent of doubling- 
up.'? The relative demand for low-skill 
labor '* had little impact on homelessness or 


"Unlike crowding, doubling-up occurs throughout 
the housing market. The two functions differ consider- 
ably as a consequence. Nevertheless, we report results 
for doubling-up, since this phenomenon is often 
claimed to be related to homelessness. 

“Our results for this variable are similar to those 
found by Quigley (1990) when he added price and 
income measures to Tucker’s analysis of homelessness. 

“In addition to low-skill service industries, this sec- 
tor includes such high-wage industries as legal, medical 
(other than hospitals), and business services. 

'*This variable is defined as the predicted skill dis- 
tribution in the metropolitan area. Specifically, it is the 
fraction of an area’s workers who ‘vould have less than 
a high-school education if workers in each occupation 
in the area had the educational distribution that pre- 
vailed for those occupations in the nation as a whole. 
The actual skill composition of the labor market may 
be a flawed measure of the demand for low-skilled 
labor, since the observed fraction of workers with less 
than a high-school degree is likely to be strongly influ- 
enced by the pool of workers available in a local labor 
market. 
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doubling-up but was positively associated 
with crowding rates. This suggests that in 
areas where a large fraction of the available 
jobs require low skill levels (and therefore 
presumably pay low wages), there is result- 
ing pressure on the ability of workers to 
afford more than minimal-quality housing. 
In combination with the negative relation- 
ship between employment growth and 
homelessness, the absence of a relationship 
between the relative skill levels of an area’s 
jobs and homelessness may suggest that even 
low-skill jobs provide sufficient income to 
avoid homelessness. This pattern of results 
suggests that local areas concerned with re- 
ducing homelessness may want to adopt 
strong pro-employment growth policies 
without regard to the type of employer be- 
ing attracted. 

Interestingly, the unemployment rate 
(overall or specific to the low-skill popula- 
tion), the long-term unemployment rate 
(unemployment of more than three months), 
and the ratio of employment to population, 
which has been found to be an important 
factor in youth labor markets (Freeman, 
1982), did not have observable impacts on 
homelessness. 

Although there was some suggestion that 
household incomes, as measured by the pro- 
portion of households in the metropolitan 
area with incomes below the poverty line, 
were related to homelessness, incomes were 
a much stronger predictor of crowding. 
However, there did not appear to be any 
relationship between incomes and the inci- 
dence of doubling-up. This finding is con- 
sistent with shared housing resulting from 
numerous personal and other factors in ad- 
dition to those that create homelessness. 

Government expenditures on social ser- 
vices would be expected to be related to 
conditions at the lower end of the housing 
market, and we experimented with a num- 


ber of alternative measures.'!° While the 


"Both poverty rates and government expenditures 
on public welfare are likely to be at least partially 
endogenous, determined as a function of other in- 
cluded factors such as labor-market conditions and 
program benefit levels. 
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overall level of expenditures by localities on 
public welfare reduced the incidence of both 
crowding and doubling-up (although the co- 
efficients were not significant at the 10- 
percent level), only program-specific bene- 
fits had any impact on homelessness.'® 
Higher maximum Aid to Families with De- 
pendent Children (AFDC) benefits were as- 
sociated with lower rates of homelessness 
(as well as with a lower incidence of crowd- 
ing). Supplemental Security Income (SSI) 
benefits, on the other hand, were associated 
with higher rates of not only homelessness 
but also crowding and doubling-up. These 
results are puzzling. It is possible that they 
may be explained by differences in the pop- 
ulations served by these two programs. Cer- 
tainly the relationship between the SSI pro- 
gram and homelessness presents a fruitful 
area for future research. 

We also included some program charac- 
teristics as proxies for the administrative 
stringency of public-assistance programs.'” 
For example, the larger the statewide re- 
duction in AFDC benefits for single moth- 
ers residing with nonpoor parents, the 
higher was the incidence of homelessness in 
the locality, although the coefficient was not 
statistically significant at the 10-percent 
level. Crowding and doubling-up, however, 
were, at most, weakly affected by this aspect 
of the AFDC program. Another measure, 
the statewide proportion of AFDC families 
that, according to a Department of Health 
and Human Services audit of state pro- 
grams, were actually eligible for the benefits 
they were receiving (the AFDC accuracy 


'6Homeless families are likely to be eligible for 
AFDC benefits; homeless single individuals may be 
eligible for Social Security Disability Insurance (SSDI), 
a Federal program requiring previous employment, or 
SSI (a means-tested, combined federal and state pro- 
gram). SSDI benefits were not included, since benefit 
levels do not vary by locality. 

"There is evidence for New York City, for example, 
that the majority of homeless families have at some 
time been AFDC recipients and that many had been 
dropped from the program for “administrative” rea- 
sons (e.g., for missing appointments or for failing to 
adhere to administrative requirements) (Anna Lou De- 
havenon, 1989; James R. Knickman and Beth C. 
Weitzman, 1989). 
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rate) did not seem to influence homeless- 
ness. However, higher accuracy rates, which 
imply that fewer families were receiving 
AFDC benefits for which they were not 
eligible, were associated with higher rates of 
crowding.'® 

Treatment of the mentally ill, as mea- 
sured by the number of inpatients in state 
mental-health facilities! appears to have 
had an impact on the incidence of home- 
lessness, although this effect is imprecisely 
measured. There has been heated contro- 
versy regarding the effect on homelessness 
of the decision in the 1960’s to attempt 
treatment of the mentally ill in community 
centers rather than in state mental hospi- 
tals.°? These findings suggest that the policy 
of not institutionalizing the mentally ill has 
been an important factor in increasing 
homelessness. While the incidence of 
crowding does not appear to be affected by 
hospitalization rates, we observe a positive 
(but again imprecisely measured) relation- 
ship with the extent of doubling-up. 

Since we are unable to control for the 
number of mentally ill in each state, these 
coefficients require interpretation. The 
mentally ill individual faces three possible 
living situations: in a state hospital, on the 
street (homelessness), or being taken in by 
relatives (doubling-up). If the rate of mental 
illness were constant across metropolitan 
areas, the coefficients on hospitalization 
rates in the homeless and doubling-up 
equations should sum to —1 and would 
represent variations in how the mentally 
ill were allocated across living situations. 
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An additional measure, the number of SSI and 
SSDI recipients dropped from the rolls in the early 
1980's, which has often been cited as a cause of home- 
lessness, was not significant in any of the equations. 

’Facilities are restricted to hospitals and do not 
include community care establishments. This measure 
was more significant than per capita state spending on 
mental health or a flow measure of the number of new 
admissions to mental-health facilities. 

“See the discussions in W. Robert Curtis (1986) 
and Redburn and Buss (1986). In 1955, there were 
559,000 patients in state mental hospitals; currently, 
there are approximately 123,000 (John A. Talbott, 
1989). 
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Conversely, if hospitalization rates were 
constant but the rate of mental illness var- 
ied across areas, there would be positive 
coefficients on the number of hospitalized 
patients in equations predicting the extent 
of homelessness and doubling-up. The mag- 
nitude of these coefficients would be equal 
to the ratios of homeless or doubled-up 
individuals among the mentally ill to those 
hospitalized. Since it is likely that both 
mental illness and hospitalization rates vary 
across states, the observed coefficients com- 
bine the impact of variations in both rates. 
This means that the negative coefficient on 
the hospitalization rate in the homelessness 
equation understates the true effect of hos- 
pitalization policies by any positive influ- 
ence from variations in the extent of mental 
illness. The positive relationship between 
hospitalization and doubling-up suggests 
that many more mentally ill individuals are 
cared for by friends and relatives than by 
state institutions. 

We also included population characteris- 
tics that might affect the low-rent housing 
market even after controlling for housing 
markets, job opportunities, and incomes. 
The percentage of births to teenage moth- 
ers, for example, substantially increased the 
incidence of doubling-up but, surprisingly, 
appears to be related to lower levels of 
homelessness. The larger the relative size of 
the black population, the higher were the 
rates of homelessness and, especially, dou- 
bling-up.”! Finally, the incidence of crowded 
housing, and possibly homelessness, was 
greater in larger metropolitan areas.” 


*lThe remaining ethnic groups, Hispanics and 
Asians, are excluded because their density is heavily 
influenced by locational decisions of recent immi- 
grants. These decisions are a function of labor markets, 
housing markets, and government policy, which are 
already in our equations. The exogenous variables in- 
cluded in the model explain 92 percent of the variation 
in the relative size of the Asian population in 
metropolitan areas, for example. 

A final factor, climate, which has often been cited 
as a cause of homelessness, was unrelated to homeless- 
ness. 
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III. Conclusion 


Our key finding is that, despite the per- 
ceived weakness of the HUD data and the 
large variation in the incidence of homeless- 
ness and other poor housing outcomes 
across metropolitan areas,” a relatively par- 
simonious and intuitively appealing set of 
factors is able to explain variations in home- 
lessness, crowding, and doubling-up across 
American cities. The findings indicate that 
homelessness has multiple causes. It has 
roots in housing markets, labor markets, 
and public policies regarding the treatment 
of the mentally ill and the low-income popu- 
lation. Although replication of this study 
using data from the 1990 Census should be 
fruitful, the results reported above are sug- 
gestive and provide a framework for discus- 
sions of policies to reduce homelessness. 


REFERENCES 


Applebaum, Richard P., Counting the Home- 
less, Chicago: Center for Urban Research 
and Policy Studies, University of Chicago, 
February 1987. 

Borsch-Supan, Axel and Pitkin, John, “On Dis- 
crete Choice Models of Housing De- 
mand,” Journal of Urban Economics, 
September 1988, 24, 153-72. 

Burt, Martha R., “Comments,” Rethinking 
Policy on Homelessness, Washington, DC: 
Heritage Foundation, 1988. 

and Cohen, Barbara E., Feeding the 
Homeless: Does the Prepared Meals Provi- 
sion Help? Vols. 1 and II, Washington, 
DC: Urban Institute, 1988. 

Curtis, W. Robert, ““The Deinstitutionaliza- 
tion Story,” Public Interest, Fall 1986, 85, 
34-49. 

Dehavenon, Anna Lou, “The Tyranny of In- 


*3The mean incidence of homelessness in the 1984 
HUD data was 187.5 per 100,000 population with a 
standard deviation of 147.8. On average, 4,422.5 out of 
every 100,000 households in the cities in our sample 
were crowded (standard deviation of 2,762.2), and 
1,617.3 were doubled-up (standard deviation of 409.8). 


3 


VOL. 83 NO. 1 


difference: A Study of Hunger, Home- 
lessness, Poor Health and Family Dis- 
memberment in 818 New York City 
Households with Children in 1988-89,” 
mimeo, East Harlem Interfaith Welfare 
Committee, New York, 1989. 

Filer, Randall K. and Honig, Marjorie, Policy 
Issues in Homelessness: Current Under- 
standing and Directions for Research, New 
York: Manhattan Institute for Policy Re- 
search, 1990. 

Freeman, Richard B., “Economic Determi- 
nants of Geographical and Individual 
Variations in the Labor Market Position 
of Young Persons,” in Richard B. Free- 
man and David A. Wise, eds., The Youth 
Labor Market Problem: Its Nature, Causes, 
and Consequences, Chicago: University of 
Chicago Press, 1982, pp. 115-54. 

and Hall, Brian, “Permanent Home- 
lessness in America?” Population Re- 
search and Policy Review, January 1987, 6, 
3-27. 

James, Franklin J., “Factors Which Shape 
the Risks of Homelessness: Preliminary 
Observations From Colorado,” mimeo, 
Denver: University of Colorado Graduate 
School of Public Affairs, September 1989. 

Knickman, James R. and Weitzman, Beth C., A 
Study of Homeless Families in New York 
City: Risk Assessment Models and Strate- 
gies for Prevention, New York: Health Re- 
search Program of New York University, 
1989. 

LaGory, Mark, Ritchey, Ferris J., O’Donoghue, 
Timothy and Mullis, Jeffrey, “Homeless 
People in Alabama: A Variety of People 
and Experiences,” in J. Momeni, ed., 
Homelessness in The United States, West- 
port, CT: Greenwood, 1989. 

Parsons, L., Literature Review of Studies 
Which Deal with Estimating the Size of the 
Homeless Population, Amherst, MA: So- 
cial and Demographic Research Institute, 
University of Massachusetts, 1986. 

Quigley, John M., “Does Rent Control Cause 


HONIG AND FILER: VARIATION IN HOMELESSNESS 255 


Homelessness? Taking the Claim Seri- 
ously,’ Journal of Policy Analysis and 
Management, Winter 1990, 9, 89-93. 

Redburn, F. Stevens and Buss, Terry F., Re- 
sponding to America’s Homeless, New 
York: Praeger, 1986. 

Reed, Renya, ““The Determinants of Home- 
lessness Amongst Families on Welfare: 
Estimation of Three Econometric Models 
of Search in the Low Income Housing 
Market,” Ph.D. dissertation, Johns Hop- 
kins University, 1992. 

Ringheim, Karin, At Risk of Homelessness: 
The Roles of Income and Rent, New York: 
Praeger, 1990. 

Robinson, Frederic G., ““Homeless People in 
the Nation’s Capital,” mimeo, University 
of the District of Columbia, 1985. 

Rossi, Peter H., Down and Out in America: 
The Origins of Homelessness, Chicago: 
University of Chicago Press, 1989. 

Stegman, Michael A., Housing and Vacancy 
Report: New York City, 1987, City of New 
York: Department of Housing Preserva- 
tion and Development, April 1988. 

Talbott, John A., “Testimony before U.S. 
Senate Subcommittee on Housing and 
Urban Affairs,” Access (Washington, DC: 
National Resource Center on Homeless- 
ness and Mental Illness), 1989, /, 3. 

Tucker, William, ““Where Do the Homeless 
Come From?” National Review, 25 
September 1987, 39, 32-43. 

Committee for Food and Shelter, Characteris- 
tics and Housing Needs of the Homeless, 
Washington, DC: ICF, Inc., 1987. 

U.S. Conference of Mayors, The Continuing 
Growth of Hunger, Homelessness and 
Poverty in America’s Cities: 1987, Wash- 
ington, DC: U.S. Conference of Mayors, 
January 1987. 

U.S. Department of Housing and Urban Devel- 
opment, A Report to the Secretary on the 
Homeless and Emergency Shelters, Wash- 
ington, DC: Office of Policy Development 
and Research, May 1984. 


Cyclical Consumption Patterns and Rational Addiction 


By ENGELBERT J. DoCKNER AND GUSTAV FEICHTINGER* 


Much of empirically observable consump- 
tion behavior seems to contradict rational- 
choice theory. Here we refer to heavy eating 
followed by strict dieting; smoking, quitting 
and starting again; taking drugs, quitting, 
and starting all over again; and so on. In a 
recent paper, Gary S. Becker and Kevin M. 
Murphy (1988) demonstrate that a wide 
variety of consumer behavior—addiction 
included—is consistent with utility maxi- 
mization. As a consequence, addictive be- 
havior need not be excluded from the 
rational-choice framework. As Becker and 
Murphy (1988) point out, an even stronger 
claim holds true: analyzing addictive behav- 
ior within the rational-choice framework 
provides new insight and a better economic 
understanding of addiction. 

Explaining addictive behavior in terms of 
utility-maximizing consumers does not im- 
ply, however, that addicts can be considered 
as happy persons. What this framework does 
is to demonstrate that for a variety of rea- 
sons certain consumers can become ad- 
dicted; once they are addicted, to stop con- 
suming the addictive good would make them 
worse-off. 

A consumer is said to be addicted to a 
consumption good c if an increase in past 
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consumption of c causes present consump- 
tion to rise. In the theory of rational addic- 
tion (see George J. Stigler and Becker, 1977; 
Laurence R. Iannaccone, 1986; Becker and 
Murphy, 1988; Daniel Leonard, 1989) past 
consumption is summarized by a stock of 
consumption capital (habits) that, together 
with current consumption, affects current 
utility. Hence, a consumer can be called 
addicted to c if consumption of c increases 
with an increase in the stock of its corre- 
sponding consumption capital. This defini- 
tion implicitly assumes that c accumulates a 
single stock (consumption capital). We call 
this stock commodity-specific consumption 
capital. It is this case that is extensively 
discussed in the literature on rational addic- 
tion. If, on the contrary, a consumption 
good accumulates several stocks of con- 
sumption capital, past consumption affects 
current utility through several channels. As 
one might expect, this requires a separate 
analysis of addiction. It is at this point that 
the present note steps in. We study a model 
of addiction in which a single consumption 
good accumulates two stocks. This is the 
simplest case without commodity-specific 
consumption capital, but it provides all the 
economic intuition for the more general 
case. 

One important consequence of the relax- 
ation of commodity-specific consumption 
capital is that the variety of possible con- 
sumption profiles over time is enlarged. This 
motivates us to explore the relationship be- 
tween various degrees of addiction and the 
consumption profile over a person’s life- 
time. In the case of commodity-specific con- 
sumption capital, Becker and Murphy (1988) 
discuss several possible outcomes: they 
show, for example, that in the case of very 
strong addiction rational consumers have to 
go cold turkey (i.e., stop consuming the good 
abruptly). In other cases with lower degrees 
of addiction, consumption follows a mono- 
tonic path. This holds true even for the case 
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of an unstable steady state. However, does 
this imply that cyclical consumption policies 
cannot be explained by the theory of ratio- 
nal addiction? Certainly not, as has already 
been demonstrated by Becker and Murphy 
(1988). Our note elaborates on their analy- 
sis, and we fully explain the causes for 
the occurrence of cyclical consumption tra- 
jectories. We show that, for the case of 
commodity-specific consumption capital, 
consumption trajectories will always be 
monotonic. Hence, cyclical consumption 
paths expressed as stable limit cycles or 
damped (explosive) oscillations require a 
consumption good that accumulates at least 
two stocks. It is the interaction of these two 
stocks that causes irregular behavior. In 
particular, we show that only if present con- 
sumption is positively correlated with past 
consumption as expressed in one of the two 
stocks (say, a stock called eating capital) but 
is negatively correlated with the other stock 
(say, one called weight) cyclical consump- 
tion patterns are possible. This implies that 
consumption cycles require two counterbal- 
ancing effects: an addictive one and a satiat- 
ing one. The addictive force causes current 
consumption to increase as past consump- 
tion accumulates (hence, the ascending part 
of the cycle); the satiating force causes cur- 
rent consumption to decline as habits accu- 
mulate (hence, the descending part of the 
cycle). 

As mentioned above, cyclical consump- 
tion profiles can be expressed both in terms 
of damped or explosive oscillations as well 
as in terms of stable limit cycles. In the first 
case, the trajectory either converges to or 
diverges from the steady-state consumption 
capitals (i.e., the long-run solutions). In the 
latter case the long-run equilibrium is not a 
single point, but rather an invariant mani- 
fold (i.e., the limit cycle). Economically, the 
existence of a stable limit cycle implies that 
binges can continue to cycle throughout 
much of a person’s lifetime. This brings us 
back to the examples discussed at the very 
beginning of this paper and explains why 
certain consumers might follow binges 
throughout much of their lifetimes. 

In Section I, we introduce a model of 
rational addiction that is a variation of the 
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two-state-variable example of Becker and 
Murphy (1988). Here we concentrate only 
on what might be called the semi-reduced- 


form model that involves the allocation of 


consumption goods over time and the corre- 
sponding accumulation of two stocks of 
consumption capital. In Dockner and 
Feichtinger (1989) we demonstrate that this 
model can be derived from a more general 
one that includes the intertemporal choice 
of leisure time and goods allocation subject 
to a wealth constraint. In Section II, we 
discuss the relationships among comple- 
mentarity over time, addiction, and satia- 
tion and use these results to give precise 
information about monotonic or cyclical 
consumption profiles. In Section III], we 
present a numerical example with strictly 
concave preferences that leads to a stable 
limit cycle as optimal policy. Section IV 
concludes this note. 


I. The Model 


We consider a representative consumer 
who at each instant of time ¢ derives utility 
from consumption of goods c(t) and c,(t) 
and stocks of consumption capital S$,(t) and 
S(t), as in 


(1) U(t)=U(c,(t),c,(t), S(t), S,(t)). 


We assume that the utility function U(c,, 
c,,5,,8,) is strictly concave jointly in (c,, 
c,, §,, Moreover, the cross partial 
derivative Us,s, is set equal to zero (ie., 
marginal utility with respect to consumption 
capital S, is independent of the level of S,, 
L#)). 

The stocks (S,(t),S$,(7)) are measures of 
past consumption of (c,(t),c,(t)) that affect 
current utility through an accumulation pro- 
cess; that is, 


(2) = — 


(3) S,(t) = 


Relations (2) and (3) are neoclassical accu- 
mulation equations with 6; as constant de- 
preciation rates, and the functions f; are 


= 
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specified depending on whether the con- 
sumption capitals are commodity-specific or 
not. Throughout the paper we assume that 
5, > 6). 

Assuming an_ infinitely-lived consumer 
with a constant rate of time preference, r, 
who maximizes the discounted stream of 
utility subject to the accumulation equations 
(2) and (3), the dynamic household problem 
becomes 


4 max So(t)) dt 
I 2 
(1), 70 


subject to (2) and (3) and given initial condi- 
tions $,(0) = Sj, > 0. 

Model (2)-(4) describes the optimal allo- 
cation of the consumption good(s) over time 
when utility depends on both current and 
past levels of consumption. It can be de- 
rived from a more general model that in- 
cludes the intertemporal choice of leisure as 
well as a wealth constraint (see Dockner 
and Feichtinger [1989] for details). 

The general specification (2)-(4) allows 
for two important examples as special cases: 
one with commodity-specific consumption 
capital and one without it. Adopting the 
definition of Iannaccone (1986), we call the 
consumption capitals (habits) commodity- 
specific if for each S; there exists a unique 
commodity c;, such that f; and U. /U, 
depend only on c,; and S,. In our case with 
two consumption capitals this implies that 
each consumption good accumulates a sin- 
gle stock, f(c,(t),c(t)) = ct), and that the 
marginal rate of substitution between cur- 
rent and past consumption is a function of 
c; and §S;. In order for the latter condition 
to be satisfied together with Us; 5, =0, we 
restrict attention to the additively separable 
case U=U,+U,. To sum up, the case of 
commodity-specific consumption capital is 
represented by the accumulation equations 
SAt) = 6;St) and the utility function 
U=U((c,,S,)+ U(c,, S,). 

In the case with no commodity-specific 
consumption capital we restrict attention to 
the simplest case possible and assume that a 
single good, c(t), accumulates two distinct 
consumption capitals—eating capital (S,) 
and weight ($,)—and that preferences are 
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of the general form represented by (1). Al- 
gebraically this amounts to 


(5) S\(t) = c(t) 6,S,(t) 
S,(t) = c(t) — 8,S,(t) 


with preferences given by U(c, S,, S,). 

The optimal paths of c(t) (the case of no 
commodity-specific consumption capital) or 
(c,(t),c(t)) (commodity-specific consump- 
tion capital) and (S,(t),S,(t)) are deter- 
mined by the first-order conditions 


(6) cor(c,,c) 


= argmax{U + + A,S>} 


(7) A, 


(8) A= (1 + — Us, 


and the transversality condition 
(9) im a(t) [S\(4) 


8,(2)]} = 0 


for all feasible states $,(-),5,(-). In the 
above equations, A; are the shadow prices 
of the consumption stocks S;. They measure 
the value of the future benefits or costs of 
consumption. Through the price system 
(A,,A,), rational consumers take into ac- 
count the future consequences of current 
actions. Depending on whether consump- 
tion capital has beneficial or harmful ef- 


fects, we expect the prices to be positive or 
negative. 


II. Addiction and Cyclical Consumption Paths 


A commodity, c, is called addictive if its 
current consumption increases as habits de- 
rived from its previous consumption accu- 


2 
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mulate. Hence, in the case of commodity- 
specific consumption capital a consumer 
displays addictive behavior if c,(t) increases 
with an increase in its corresponding S,(t). 
This occurs if preferences are characterized 
by adjacent complementarity .' 

The case of non-commodity-specific habits 
requires a separate treatment. First, we have 
to present a formal definition of addiction. 
Second, we need to explore whether the 
relationship between addiction and adjacent 
complementarity carries over to this case. 
This is‘not a trivial issue because past con- 
sumption affects current utility and hence 
current consumption through several chan- 
nels, and therefore the interaction of the 
stocks will be of importance. In our example 
it is the current level of both eating capital 
and weight that influence c(t). 

To shed some light on these open ques- 
tions we concentrate on the optimal path 
for the simplest case with nonspecificity as 
summarized through (5). Introducing a 
transformation of variables such that the 
steady-state capital stocks and the steady- 
state level of the single consumption good 
are zero and allowing for linearizations, the 
dynamics of the optimal solution is gov- 
erned by 


(10) S\(t) = aye"! + 


(11) 


S,(t) = a,,e"" + a,,e"?" 


where w, and yp, are the stable roots (or 
the two smaller roots in the case of instabil- 
ity) of the linearized modified Hamiltonian 


The relationship between addiction and adjacent 
complementarity in the case of commodity-specific con- 
sumption capital first appeared explicitly in Marcel 
Boyer (1983) and was discussed at length by Iannac- 
cone (1986) and Becker and Murphy (1988). It ap- 
peared implicitly but was not explicitly recognized in 
Harl E. Ryder and Geoffrey M. Heal (1973). 
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system. These dynamics imply that, locally 
around the steady state, the optimal c(t) is 
a linear function of both S(t) and S,(t); 
that is, 


(12) c(t)=aS,(t)+ BS,(t) 


5,— 6, 


5, 65 


S,(t) 


(see the Appendix for derivations). This re- 
lationship allows for a general discussion of 
addictive and other behavior in the case of 
non-commodity-specific consumption capi- 
tal. We call a consumer fully addicted to c 
if both stocks of consumption capital are 
positively correlated to c (i.e., if a and B 
are positive). In terms of our example, a 
consumer is fully addicted to c if both an 
increase in eating capital S, and an increase 
in weight §, cause him to consume more of 
c. This behavior will lead to a monotonic 
consumption profile and is consistent with 
convergence to the steady state (stability). A 
consumer need not be fully addicted to c, 
but instead may display addictive and satiat- 
ing behavior at the same time. We call this 
behavior partially addicted. This occurs if 
one of the stocks is positively correlated 
with c and the other one is negatively corre- 
lated with c. As we will see, this behavior 
most likely results in consumption cycles. 
Before discussing cyclical consumption pat- 
terns, however, let us answer the question 
of what causes a consumer to become fully 
addicted if habits are not commodity- 
specific. The next proposition provides an 
answer to this question. 


PROPOSITION 1: If consumption capitals 
S, and S, are not commodity-specific, then a 
consumer is fully addicted to c (i.e., a and B 
are both positive) if and only if his behavior 
displays adjacent complementarity with re- 
spect to both stocks. 


(See the Appendix for a proof.) 
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Proposition 1 generalizes the relationship 
between addiction and adjacent comple- 
mentarity to non-commodity-specific habits. 
It also reveals that addiction implies that 
past consumption as measured in S, and S, 
raises the marginal utility of current con- 
sumption (i.e., > 0). 

The proof of Proposition 1 also shows 
that addiction with respect to both habits 
implies monotonic consumption paths and 
that it is consistent with convergence to 
the steady state: as long as we have — 6, 
<p,<-—6, and 0>p,>-—6,, the con- 
sumer displays addictive behavior that re- 
sults in monotonic convergence to the steady 
state. 

As is clear from equation (12) cyclical 
consumption paths require that one of the 
stocks is positively correlated with c and 
that the other one is negatively correlated 
with c. These cycles can be expressed as 
damped /explosive oscillations or limit cy- 
cles, depending on whether the eigenvalues 
@, and pw, are complex with negative 
(damped oscillations), positive (explosive os- 
cillations), or zero (limit cycles) real parts. 
As has been demonstrated by Becker and 
Murphy (1988) the roots w, and p, are 
conjugate complex if preferences display ad- 
jacent complementarity with respect to the 
stock with the higher depreciation rate and 
distant complementarity otherwise. In any 
case, conjugate complex roots imply that 
consumption is positively correlated with S,, 
the stock with the higher depreciation rate, 
and negatively correlated with S,. In light 
of equation (12), this allows for an interest- 
ing economic interpretation. Cyclical con- 
sumption patterns require two counterbal- 
ancing forces: an addictive force that causes 
current consumption to rise as past con- 
sumption accumulates and a satiating force 
that causes it to fall. The higher deprecia- 
tion rate of the addictive stock then gener- 
ates periods with increasing as well as de- 
creasing consumption. 

This discussion summarizes the existence 
of cyclical consumption paths in a model 
without commodity-specific consumption 
capital. What remains to be shown is 
whether consumption cycles are possible 
with commodity-specific habits. 
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PROPOSITION 2: Jf consumption capital 
is commodity-specific, optimal consumption 
paths are monotonic. 


(See the Appendix for a proof.) 

This is an interesting result and under- 
lines the importance of nonspecificity for 
cyclical consumption profiles. Consumption 
cycles are possible only if a single consump- 
tion good accumulates two stocks of con- 
sumption capital, one of which generates 
addictive and the other one satiating behav- 
ior. These cycles, however, are the result of 
forward-looking behavior. Only a consumer 
with a desire to eat and a dislike for weight 
who anticipates the future consequences of 
his current actions can end up in eating and 
dieting cycles. 


III. Persistent Consumption Cycles: 
A Numerical Example 


The general discussion carried out in the 
preceding section has revealed that differ- 
ent configurations for optimal policies are 
possible depending on the natv.e of the 
consumption capital (whether it is commod- 
ity-specific or not) as well as on complemen- 
tarity in consumption. In particular, we have 
seen that the theory of rational addiction is 
capable of explaining cyclical consumption 
paths expressed as damped /explosive waves 
or limit cycles. In this section we present a 
numerical example that establishes a stable 
limit cycle as optimal policy.? Our interest 
in limit cycles as optimal consumption paths 
is motivated by the fact that a stable limit 
cycle can be identified as persistent oscilla- 
tory behavior and hence is capable of ex- 
plaining binges that continue to cycle 
throughout much of a person’s lifetime. 

We have to restrict our analysis to the 
case without commodity-specific consump- 


*It should be noted that Marcel Boyer (1978) does 
mention the possibility of a limit cycle as an optimal 
consumption path but does not characterize it. Becker 
and Murphy (1988) characterize cyclical consumption 
in terms of damped /explosive waves but do not discuss 
limit cycles. 
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tion capital. We specify the utility function 
as 


(13) U(c,S1,82) = + 


+ + 552 + wes}. 


The utility function (13) is strictly concave 
jointly in (c,S,,S,) as long as <1, 
nt+e<1,0<0,v <0, €<0, and @v 
0. As the inequality for the marginal utility 
of consumption (U. > 0) is not globally satis- 
fied for (13) we restrict our numerical calcu- 
lations to the region of the (c, S,,S,)-space 
where it holds. 

For the numerical example we make use 
of the following parameter values: 


(14) 6,=0.45 6,=0.14 =0.50 


€=049 @=-2 
€é=-10 w=55. 


— 16 


To prove the existence and stability of the 
limit cycle we apply the Hopf bifurcation 
theorem (see John Guckenheimer and Philip 
Holmes [1990] for details). Among other 
things, this requires the existence of a pair 
of imaginary roots uw, and p>. As noted, 
pure imaginary roots result in a positive 
correlation between c and S, and a nega- 
tive correlation between c and S,. 


PROPOSITION 3: If the parameters are 
specified as in (14) and the time preference 
rate assumes the critical value r, = 5.8218875, 
then there exists a pair of imaginary roots 
that gives rise to the local existence of limit 
cycles. These cycles are stable and occur for 
an interval of time preference rates r slightly 
greater than ry. 


PROOF: 

A detailed numerical analysis of this re- 
sult can be found in Dockner and 
Feichtinger (1989). 


Proposition 3 provides us with the existence 
of stable limit cycles as rational intertempo- 
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FIGURE 1. OpTIMAL CONSUMPTION (c), EATING 
Capita (S$), AND WEIGHT (W) PRoFILES OVER 
TIME 


ral consumption choices. These cycles are 
generic in the sense that they do occur for 
an interval of time preference rates (param- 
eter values) (see Guckenheimer and 
Holmes, 1990). Hence, limit cycles are not 
knife-edge, as perhaps suggested by the criti- 
cal value rp. 

The cyclical trajectories corresponding to 
the stable limit cycle are depicted in Figure 
1. These paths show the following charac- 
teristics. Initially, both eating capital and 
weight are low. Given the positive correla- 
tion between S$, and c, consumption rises. 
While c increases eating capital rises more 
rapidly than weight. Eventually, consump- 
tion levels off while weight continues to 
increase. With declining consumption and a 
high rate of depreciation, eating capital de- 
clines. Dieting also causes weight to decline 
so that, after some time, both eating capital 
and weight are at their initial values, and 
the cycle starts all over again. 
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IV. Concluding Remarks 


This paper presents two new propositions 
in the theory of rational addiction. First, we 
show that the notion of commodity-specific 
consumption capital is crucial for the un- 
derstanding of cyclical consumption paths. 
Only a model of rational addiction without 
commodity-specific consumption capital is 
capable of explaining consumption cycles. 
Second, we demonstrate that consumption 
behavior may very well end up in persistent 
oscillations (i.e., a limit cycle). We derive 
this result in terms of a numerical example 
and use it to explain binges that continue to 
cycle much throughout a person’s lifetime. 


APPENDIX 


Derivation of Equation (12).—Differ- 
entiation of (10) and (11) with respect to 
time ¢ and substituting equation (5) yields 


c(t) = + 44,8, 

c(t) + 4,6, )e"" 

+ + 


Substituting (10) and (11) into c(t)=aS,(t) 
+ BS(t) yields 


c(t) =(a@a,, + Ba,,)e*" 
+ + Ba,,)e"?". 


Comparing coefficients in the above three 
equations results in 


Qa), + Bay, = 
+ Bay = + 
+ Bar, = 


+ = + 6) 
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with a unique solution 


as 
1 422 
| 
4; 42 


that provides us with the coefficients in 
equation (12). 


PROOF OF PROPOSITION 1: 
From Dockner and Feichtinger (1991), we 
know that the stable roots are given by 


=~-y |=] -~+=VK?—4det 
2°2 


r r\2 K 1 
5-y(5) —Adet J 


det J = — ¥2 Ay 


with y, = 6(6; + r) and 
A; =- U.. [(r +26;)U.5. Us.s,|- 


There is said to be adjacent complementarity 
with respect to stock S; if A;>0 and dis- 
tant complementarity if A;<0 (see Ryder 
and Heal, 1973). 

According to equation (12) and keeping 
in mind that 6,>6,, a@ and B are both 
positive if and only if +6, >0, w,+6,> 
0, and uw, + 6, <0 hold simultaneously. It is 
easily shown that this, however, is equiva- 
lent with A, >0and A,>0. 


PROOF OF PROPOSITION 2: 

Iannaccone (1986) has shown that with 
commodity-specific consumption capital the 
optimal path of S; near the steady state is 
approximated by S(t)=S* where 
S* is the steady-state level, k; is a constant, 
and py; is a stable eigenvalue given by 


Ay. 
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As is shown in Dockner and Feichtinger 
(1989) strict concavity of preferences im- 
plies (r /2+ 6,;)? — A; >0. Hence S(t) can 
never be cyclical. Noting that, in the neigh- 
borhood of the steady state, sgn(¢,) = 
sgn( A ,)sgn(S;) implies that consumption can 
never be cyclical, too. 
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The Uncertain Unit Root in Real GNP 


By GLENN D. RuDEBUSCH* 


The impulse and propagation mecha- 
nisms of business cycles have long been 
debated; however, until recently, economists 
were in fairly broad agreement that busi- 
ness fluctuations could be studied sepa- 
rately from the secular growth of the econ- 
omy. This separation was justified because, 
to a first approximation, the factors underly- 
ing trend growth were assumed to be stable 
at business-cycle frequencies. Indeed, the 
common practice of macroeconomists of all 
theoretical persuasions was to model move- 
ments in real GNP as stationary fluctuations 
around a linear deterministic trend (e.g., 
Finn Kydland and Edward C. Prescott, 1980; 
Olivier J. Blanchard, 1981). Such a trend- 
stationary (TS) model of real GNP was the 
canonical empirical representation of aggre- 
gate output until the early 1980s. 

In contrast to previous work, much of the 
research of the last ten years has assumed a 
unit root in the autoregressive representa- 
tion of real GNP, which is inconsistent with 
a TS model of output. A model with a unit 
root, commonly termed a “difference- 
stationary” (DS) model, implies that any 
stochastic shock to output contains an ele- 
ment that represents a permanent shift in 
the level of the series. If real GNP is best 
represented by a DS model, the traditional 
separation between business cycles and 
trend growth is incorrect. In the usual em- 
pirical versions of the DS model estimated 
for real GNP, output behaves more like a 
random walk than like transitory deviations 


*Board of Governors of the Federal Reserve Sys- 
tem, Division of Monetary Affairs, Washington, DC 
20551. I thank two anonymous referees for comments 
as well as Bill Bell, Frank Diebold, Steve Durlauf, 
Spence Krane, Doug Steigerwald, and David Wilcox. 
Michelle Phillips and Roberto Sella provided research 
assistance. The views expressed are my own and are 
not necessarily shared by the Board of Governors or its 
staff. 


from a steadily growing trend (e.g., John Y. 
Campbell and N. Gregory Mankiw, 1987; 
Peter K. Clark, 1987). The long-run trend of 
these DS representations is not fixed, as in 
a TS model, but stochastic; indeed, in the 
typical DS model, almost all fluctuations in 
output represent permanent shifts in trend 
rather than transitory movements in cycle. 
From another perspective, the essential dif- 
ference between the two models can be 
found in the persistence of their dynamic 
responses to random shocks. In the DS 
model of output, the effect of a shock per- 
sists forever because the disturbance 
changes the trend component and thus af- 
fects the level of output in all future peri- 
ods. In contrast, the impact of a shock in 
the TS model is transitory and is eliminated 
quite quickly as output reverts to its steady 
trend. 

The widespread acceptance of a DS model 
for aggregate output was based on evidence 
that the hypothesis of a unit root in real 
GNP could not be rejected.' Although this 
evidence was extremely robust across vari- 
ous data samples and unit-root testing pro- 
cedures (see e.g., Charles R. Nelson and 
Charles I. Plosser, 1982; René M. Stulz and 
Walter Wasserfallen, 1985; James H. Stock 
and Mark W. Watson, 1986; Pierre Perron 
and Peter C. B. Phillips, 1987; Perron, 1988; 
George W. Evans, 1989), the distinct con- 
trast between traditional TS models of out- 
put and recent DS models led many re- 
searchers to challenge the unit-root tests. In 
particular, some questioned the power of 
these tests, that is, their ability to reject the 
unit-root null hypothesis when it is indeed 


'This acceptance was not generally shared by gov- 
ernment and business economists. For example, the 
official series on potential output, which is used in 
policy and budget deliberations, has remained a smooth 
nonstochastic trend. 
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false.2 However, the indictment of low 
power against unit-root tests has not been a 
decisive criticism. The fact that unit-root 
tests may have low power against. ‘rtain TS 
alternatives does not necessarily compro- 
mise the results from those tests. It would 
not be at all surprising if unit-root tests had 
little power against TS alternatives that 
mimicked a DS model and reverted to trend 
extremely slowly; all statistical tests have 
low power against alternatives that are “‘lo- 
cal” to the null. The failure to reject near- 
unit-root TS alternatives is of little eco- 
nomic importance, however, because these 
alternatives are indistinguishable in eco- 
nomic terms from the DS null over the time 
horizons of practical macroeconomic inter- 
est (say, shorter than 10 years). Critics of 
unit-root tests must instead make the 
stronger claim that unit-root tests have low 
power against plausible TS alternatives that 
display substantially different macroeco- 
nomic behavior than a plausible DS null 
model. Simple power studies are not well 
suited to answering this question. 

This note addresses the argument that 
unit-root tests have low power only against 
local alternatives. The goal is to select the 
most plausible DS and TS representations 
for output, determine whether these repre- 
sentations have different short-run persis- 
tence properties, and then examine whether 
unit-root tests can distinguish between these 
models. In the next section, I examine the 
ordinary least-squares (OLS) estimates of a 
TS model and a DS model for the postwar 
sample of real GNP. These models are 
plausible representations of the data gener- 
ating process and yet imply very different 
economic dynamics at the horizons of eco- 
nomic relevance. In Section II, I examine 
simulated data from these models and show 
that a unit-root test cannot distinguish be- 


*Among many others, Bennett T. McCallum (1986), 
Francis X. Diebold and Rudebusch (1991), and David 
N. DeJong et al. (1992) have warned of the low power 
of unit-root tests. Other challenges have been made by 
DeJong and Whiteman (1991) from a Bayesian per- 
spective and by Perron (1989) in a framework with 
structural breaks. 
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tween these particular TS and DS represen- 
tations. Thus, the unit-root test has low 
power against a plausible TS model that is 
not local in economic terms to a plausible 
DS model. Section III extends the basic 
argument to account for small-sample bias 
in the estimated model coefficients. 


I. DS and TS Models of Real GNP 


Two obvious candidates for plausible TS 
and DS representations of the data generat- 
ing process for real GNP are simply the 
OLS estimates of these models from the 
available data sample. The data consist of 
quarterly observations on U.S. postwar log 
real GNP per capita (denoted Y,) from 
1948:3 to 1988:4. Assuming second-order 
dependence,* the sample OLS estimate of 
the TS model of aggregate output with a 
linear deterministic trend is 


(1) Y,=—0.321 + 0.00030¢+ 1.335Y,_, 
(0.109) (0.00010) (0.073) 


—0.401Y,_5 + u, 
(0.073) 


= 0.01013 


(standard errors of the coefficients appear 
in parentheses).* I will refer to this specific 
model estimate for the sample as the TSo; 5 
model. 

Under the assumption of a unit root, the 
DS model for this data sample is estimated 
in first differences as 


(2) AY, = 0.003 + 0.369AY,_, +0, 
(0.001) (0.074) 


= 0.01035 


This particular sample DS model will be 
denoted as the DS,;, model. 


‘The analysis was repeated assuming that the order 
of the model was four, six, and eight. Similar persis- 
tence and power results were obtained. 

*The last two quarters of 1948 were used as fixed 
initial conditions. 
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The estimated models (1) and (2) both 
appear to fit real GNP per capita fairly well; 
the standard deviations of their residuals 
are quite close, and plots of the residuals 
suggest no obvious outliers. In addition, Q 
statistics computed from the fitted residuals 
provide little evidence against the null hy- 
pothesis of no serial correlation at a variety 
of lags. 

However, the estimated TS,,, and <> 
models have very different implications for 
the persistence of the dynamic response of 
output to a random disturbance. To mea- 
sure this persistence, consider the moving- 
average representation for the first differ- 
ence of output implied by a TS or DS 
model: 


(3) 6,_,+... 


where k is some constant and e, is the 
innovation of the model. In this form, the 
sum of the a,’s measures the model re- 
sponse to a unit innovation.° A unit shock in 
period ¢ affects AY,,, by a, and affects 
---+a,. Thus, for 
various horizons, the cumulative response 
c, answers the question: how does a shock 
today affect the level of real output in the 
short, medium, and long run? With quar- 
terly data, for example, c,, measures the 
impact of a shock today on Y, five years 
hence. 

In the limit, the effect of a unit shock 
today on the level of output infinitely far in 
the future is given by c,,. For any TS series, 
c,,= 0, because the effect of any shock is 
eliminated as reversion to the deterministic 
trend eventually dominates. For a DS se- 
ries, c,,#0; that is, each shock has some 
permanent effect. However, the impulse re- 
sponse of real output at an infinite horizon 
is of no practical economic significance; in- 
deed, horizons of less than 10 years are 
usually of greatest interest. At these short 
horizons, the dynamic responses of TS and 


‘This measure of persistence is described further 
in Campbell and Mankiw (1987) and Diebold and 
Rudebusch (1989). 
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DS models may be quite similar or quite 
different depending on the values taken by 
the parameters of the models. Thus, the 
presence of a unit root determines whether 
C,, iS positive or zero, but it does not deter- 
mine all of the model properties of eco- 
nomic interest. It is in this sense that, as 
noted in the introduction, focusing solely on 
the existence of unit roots and on the power 
of unit-root tests against arbitrary TS alter- 
natives is insufficient. What is of economic 
relevance is the ability of unit-root tests to 
recognize when data have been generated 
from TS models that differ substantially 
from the DS model at short horizons (i.e., 
the ability to identify economically nonlocal 
alternatives). Consequently, a comparison of 
the persistence properties of the estimated 
TSors and DSo;, models at relevant hori- 
zons is required. 

The estimated model responses are shown 
in Table 1, with standard errors in paren- 
theses.° The impulse response of the DSo) 5 
model implies not only shock persistence 
but shock magnification. The effect of an 
innovation is not reversed through time, and 
it eventually increases the level of real GNP 
by more than one and a half times the size 
of the innovation = 1.59). In contrast, 
the TS model exhibits fairly rapid reversion 
to trend, with 85 percent of a shock dissi- 
pated after five years (c,) = 0.15). Thus, the 
cumulative impulse responses of these two 
models, each estimated from the same data 
sample, imply very different economic dy- 
namics at cyclical frequencies. Because the 
and DSo;, models of aggregate out- 
put have such different persistence proper- 
ties, it would be useful to have a test capa- 
ble of distinguishing between them. The 
next section explores the ability of one com- 
monly used unit-root test to accomplish this 
task. 


°These standard errors are calculated as follows. 
Let the cumulative impulse response at horizon h be 
given by c, = F(p),...,p,), and let f denote the vector 
of partials of F with respect to the parameters; then, 
the standard error equals yf’&f, where = is the esti- 
mated variance-covariance matrix of the autoregressive 
parameters. The standard errors account for parameter 
uncertainty but not for unit-root uncertainty. 
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TABLE 1—CUMULATIVE IMPULSE RESPONSES OF OLS MopELs 


Horizon (quarters) 


Model l 2 4 8 


12 16 20 


(0.07) (0.13) 


1.57 
(0.17) 


1.59 
(0.19) 


Mas 1% in 
(0.07) (0.13) (0.18) 


0.73 
(0.23) 


(0.19) 


(0.23) 


1.59 1.59 1.59 
(0.19) (0.19) 


0.43 


0.25 
(0.20) 


0.15 
(0.15) 


0.04 
(0.07) 


Note: Standard errors are given in parentheses. 


II. Application of a Unit-Root Test 


The augmented Dickey-Fuller unit-root 
test (David A. Dickey and Wayne F. Fuller, 
1981) is often used to try to distinguish a TS 
model from a DS model.’ For the second- 
order models under consideration, the aug- 
mented Dickey-Fuller regression takes the 
following form: 


(4) Y=A+ r+ 


Under the unit-root (or DS model) null 
hypothesis, 6=1; thus, the Dickey-Fuller 
test statistic is simply the ¢ test, 7 =(6 —1)/ 
SE(5), where SE(8) is the standard error 
of the estimated coefficient. 

For the postwar real GNP data under 
consideration, the sample value of the 
Dickey-Fuller test, which is denoted as Temp? 
is equal to —2.98. However, this statistic 
does not have the usual Student-t distribu- 
tion, but is skewed toward negative values. 
At the 10-percent significance level, Dickey 
and Fuller (1981) calculate the appropriate 
asymptotic critical value to be —3.12. Thus, 
the evidence from this sample, in accor- 
dance with the findings of previous re- 
searchers, suggests that the DS model for 
real GNP cannot be rejected at even the 
10-percent level. 


7Similar results to those below were also obtained 
with the Dickey-Fuller normalized-bias and 
likelihood-ratio tests, as well as with the generalized 
Phillips test (Phillips and Perron, 1988). 


However, the critical values provided by 
Dickey and Fuller (1981) for their aug- 
mented test are only valid asymptotically. In 
finite samples, the distribution of 7 will 
usually depend on the sample size and 
nuisance-parameter values (see e.g., Gene 
Evans and Savin, 1984). These factors can 
be taken into account by examining simu- 
lated data from the DSg,, model and calcu- 
lating the exact probability of obtaining the 
sample value of the test statistic from this 
particular null model. This ensures correct 
size for the test. More importantly, how- 
ever, by simulating the TS,,;, model, the 
exact probability of obtaining 7,,,,, from 
this particular alternative model can also be 
obtained. This allows correct assessment of 
test power against what is arguably one of 
the most interesting alternatives. 

The test-statistic probability distributions 
conditional on the OLS models are exhib- 
ited in Figure 1. The distribution of 7 con- 
ditional on the DSo,, model is denoted 
fps(*), while the distribution of 7 condi- 
tional on the TSo,, model is denoted 
fys(4). Each distribution is formed from 
10,000 realizations of the test statistic calcu- 
lated from 10,000 simulated data samples 
generated from the particular model.® The 
actual sample value of the test statistic 
(F.amp = — 2-98) is shown as a vertical dotted 
line. 


*The samples are generated with normal indepen- 
dently and identically distributed errors with sample 
size and initial conditions that matched those in equa- 
tions (1) and (2). 
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Ficure 1. EmpiricAt DENSITIES OF 7 FROM DSg; 5 AND TSo; 5 MODELS 


There are two areas in Figure 1 of special 
interest. The hatched area under f),(7) 
and to the left of 7...) represents the prob- 
ability of obtaining a value of the f¢ test 
equal to or smaller than — 2.98, conditional 
on the DS model of equation (2). This p 
value, is denoted as 


p value 


samp 


and represents the marginal significance 
level for rejection of the null hypothesis for 
the DSo;, model. This probability equals 
0.15; that is, given the sample test statistic, 
one could not reject the DS model at any- 
thing less than the 15-percent level in a 
classical hypothesis test. This is consistent 
with the usual inability to reject the DS 
model for real GNP at conventional signifi- 
cance levels. 

The other area of interest is the shaded 
region under f;,(7) and to the right of 
a oo This area represents the probability of 
obtaining a value of the ¢ test equal to or 
greater than — 2.98, conditional on the TS 


model of equation (1). This probability is 
denoted as 


TSors p value 
= prob(? = Famp!TSo.s model). 


For real GNP, the TS,; 5 p value is 0.22, so 
one would not be able to reject the esti- 
mated TS,o;, model at even the 20-percent 
significance level.” 

In short, the sample statistic for the aug- 
mented Dickey-Fuller test does not provide 
strong evidence against either the estimated 
DSo;s model or the TS,;, model for real 
GNP. Earlier papers that are unable to 
reject a unit root in output provide only one 
side of the relevant evidence for inference 
regarding the DS model. The other side, 
namely, the inability to reject a plausible TS 
model that exhibits transitory cyclical dy- 
namics of a traditional nature, is at least as 
convincing. 


“An equivalent statement of this result is that the 7 
test at the 15-percent significance level has power 
against the TSo;¢ alternative of only 78 percent. 


fog(7 ) 

A 

3 = prob(7 < model) 
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TABLE 2—PROPERTIES OF THE OLS EstTIMATE OF AN AR(1) MopDEL 


Statistic 0.40 0.60 0.80 5 0.90 0.95 1.00 


E(p,) 0.38 0.57 0.76 0.! 0.86 0.90 0.94 


prob(p, < p;) 0.61 0.65 0.72 7 0.81 0.89 0.99 


Note: Each column is based on 10,000 samples (each with 160 observations) drawn 
from an AR(1) with an autoregressive coefficient equal to p;. 


Ill. Unbiased DS and TS Models 


At first glance, the and TSo;s 
models might appear to be the most plausi- 
ble candidates for DS and TS representa- 
tions of the data generating process of real 
GNP. However, although the OLS esti- 
mates of these autoregressive models are 
consistent and asymptotically normal, they 
are biased in small samples because the 
presence of lagged dependent variables vio- 
lates the assumption of nonstochastic re- 
gressors in the classical linear regression 
model. This bias is easy to illustrate for the 
OLS estimate of the autoregressive parame- 
ter of an AR(1) process, 


(5) yt + 


Based on 10,000 samples of size 160 gener- 
ated from equation (5), Table 2 provides the 
mean value of the OLS estimate, p,, as well 
as the proportion of estimates that are less 
than the true value of p,.'° For example, if 
the true p, is equal to 0.95, the mean OLS 
estimate is 0.90, and 89 percent of the esti- 
mates are less than 0.95. The size of the 
autoregressive parameter bias that pushes 
the average OLS estimate below its true 
value varies with the value of the true pa- 
rameter, but it is most severe for near-unit- 
root models (i.e., those with p close to 1.0). 

A significant bias in the OLS sample esti- 
mates is potentially a serious shortcoming of 
the simulation methodology pursued in the 
previous two sections. In particular, the es- 


Table 2 is generated with p = y = x, = 0 and dis- 


turbances drawn from a standard normal distribution. 


timation bias implies that the sample OLS 
models used above probably understate the 
actual amount of persistence in real output. 
Suppose, for example, that an AR(1) TS 
representation like equation (5) was fit to 
real output, and the resulting OLS sample 
estimate , was equal to 0.90; consequently, 
the associated estimate of the 10-period cu- 
mulative impulse response, ¢,), would be 
0.34. Assuming that the OLS estimate was 
equal to its mean, E(p,), then the true 
parameter would be 0.95, and the actual 
value of c,, would be 0.60. Thus, in this 
case, the OLS sample estimate, on average, 
understates the amount of persistence. 
Arguably, more plausible candidates than 
the TS,, 5 and DSo,, models for the data- 
generating process would correct the pa- 
rameters for small-sample bias.'! First-order 
approximate bias corrections can be calcu- 
lated for both the TS and DS models quite 
easily. The bias of the DSo,, model, which 
is an ARQ) model with an unknown drift 
parameter, is treated in F. H. C. Marriott 
and J. A. Pope (1954). They show that, 
ignoring second-order terms, the expected 
value of the OLS estimate p, is given by 


(0) =p, -(+3p,)/T 


where 7 is the sample size. Substituting the 
sample OLS estimate for its expected value 


In this paper, I consider approximate mean- 
unbiased estimators of the TS and DS models. In 
Rudebusch (1992), I examine median-unbiased estima- 
tors obtained through repeated simulations, a proce- 
dure which would lead to qualitatively similar results if 
applied to the data set in this paper. 
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provides a bias-corrected estimator 
(7) (TA, +1) /(T -3). 


Note that, to a first-order approximation, 
this estimator is unbiased:!7 


E(p,) = pi. 


Applying (7) to the DS,o,, model for post- 
war real GNP given in (2), where p, = 0.369, 
I calculate the autoregressive coefficient of 
the bias-corrected DS model (denoted as 
the DS, model) to be p, = 0.383. 

Similarly, for the TS5;, model, which is 
an AR(2) with linear trend, the (first-order) 
bias in the estimated parameters can be 
determined using the results in Robert A. 
Stine and Paul Shaman (1989). Correcting 
for bias gives a TS,. model with autore- 
gressive coefficients: 


(8) 6, = (Tp, +2+2p,)/(T -1) 
for the one-period lag, and 
(9) = (Tp, +3)/(T 


for the two-period lag. For postwar real 
GNP, the TS,. model has autoregressive 
coefficients p, = 1.351 and p, = — 0.395. 
The bias-corrected and mod- 
els both display somewhat greater persis- 
tence than the and models. 
The bias correction embodied in the DS, 
model is in the same direction and of the 
same magnitude as the one suggested by 
Table 2. In particular, because the root of 
the associated lag-operator polynomial is so 
far from the unit circle, the bias of the 
model is quite modest.'? For the 


'2Of course, reduced bias does not necessarily en- 
sure that p, is a better estimator. However, Guy H. 
Orcutt and Herbert S. Winokur (1969) explore the 
properties of this estimator through simulations and 
find that it often has a smaller mean squared error 
than the OLS estimator. Theoretical results on this 
issue are provided in Hong-Ching Zhang (1989). 

“The bias in the DSo; 5 model is even smaller than 
the one in Table 2 because a linear trend is not 
estimated. With a linear trend, as in Table 2, the bias is 
given by E(p,)— p, = —(2+4p,)/T. 
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second-order and TS,. models, a 
useful metric with which to judge their 
closeness to a nonstationary model is simply 
the sum of the autoregressive coefficients 
(see e.g., Phillips, 1991). The sum of the 
OLS estimates p, and p, equals 0.933, while 
the sum of the bias-corrected coefficients p, 
and p, equals 0.955—a clear, though some- 
what small, shift toward nonstationarity.'* 

More specifically, the implications of the 
bias correction for judging the persistence 
of real GNP are given in Table 3, which 
contrasts the impulse responses of the DS, 
and TS models. The impulse response of 
the DS, model implies a shock persistence 
that is virtually indistinguishable from that 
of the DS, , model, which is not surprising 
given the trivial size of the coefficient bias. 
Moreover, the TS, model exhibits rever- 
sion to trend only slightly less rapid than 
that of the TSo,;, model. For the TSx- 
model, almost two-thirds of a shock is dissi- 
pated after five years (c) = 0.37). Most im- 
portantly, it remains true that the DS,, and 
TSyc candidates for plausible representa- 
tions of aggregate output have quite differ- 
ent implications about dynamic responses 
over fairly short horizons. 

As a final step, one can ask whether the 
augmented Dickey-Fuller test can distin- 
guish between these two models. Based on 
10,000 samples generated from the DSx- 
model,'> I obtain the probability of the sam- 
ple test statistic as 


DS xc p value 
= prob(? < F.amp!DSgc model) = 0.15. 


In contrast, the probability of the test 
statistic under the bias-corrected alternative 


‘The bias correction in the AR(2) TS model may 
appear to be surprisingly small in light of the large 
biases shown for the AR(1) model in Table 2. How- 
ever, as noted by Stine and Shaman (1989), the results 
for the AR(1) model do not generalize to higher-order 
models. Indeed, there are cases in which the bias (to a 
first-order approximation) moves the roots of the lag- 
operator polynomial closer to the unit circle. 

“The OLS estimates of the trend and intercept are 
used in generating data. 
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TABLE 3—CUMULATIVE IMPULSE RESPONSES OF BIAS-CORRECTED MODELS 


Horizon (quarters) 


8 


12 16 


1.62 


0.97 


1.62 


0.70 0.51 


model is 


p value 


= prob(7 > F.amp!TSgc model) = 0.24. 


These probabilities provide further confir- 
mation of the inability of unit-root tests to 
identify plausible TS alternative models for 
real output that display low persistence. In 
sum, the biases present in the OLS esti- 
mates are not substantial enough to change 
the conclusions from Section I and II. 


IV. Conclusion 


Based on the usual unit-root tests, little 
can be said about the relative likelihood of 
the specific DS and TS models of real GNP 
given above. In particular, plausible TS al- 
ternatives that fit the data cannot be de- 
tected because of low test power. The ap- 
propriate conclusion from unit-root tests on 
this data sample is that the existence of a 
unit root is uncertain. 

Furthermore, the unit-root tests em- 
ployed display low power against plausible 
TS models that display quite different eco- 
nomic behavior than a plausible DS null 
model. If the TS models described above 
correctly portray the persistence of real 
GNP, then the DS model does not provide 
a good approximation of the dynamic re- 
sponse of output at even short-run cyclical 
horizons of, say, five years. 

In light of the impulse responses of the 
TS models, the appropriate confidence in- 
tervals for estimates of short-term or 
medium-term persistence are much larger 
than ones given conditional on the existence 
of a unit root. This suggests the importance 
of measuring the confidence intervals for 
estimates of persistence without condition- 
ing on the TS or DS model. In Diebold and 


Rudebusch (1989), approximate estimates of 
such intervals are obtained by using a model 
of fractional integration that nests the TS 
and DS models. Stock (1991) also provides a 
step in this direction by obtaining confi- 
dence intervals for the largest autoregres- 
sive root. 

In sum, the evidence in this paper and in 
other recent work, notably Lawrence J. 
Christiano and Martin Eichenbaum (1990), 
suggests that a new consensus should be 
formed that stresses the uncertainty about 
the existence of a unit root in real output 
and the uncertainty about the amount of 
persistence of macroeconomic shocks. 
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A Note on Competitive Investment under Uncertainty 


By Rosert S. Pinpyck* 


Uncertainty over future output prices or 
input costs can affect investment by a risk- 
neutral firm in two opposing ways. First, it 
can increase the value of the marginal unit 
of capital, which leads to more investment. 
This only requires that the stream of future 
profits generated by the marginal unit be a 
convex function of the stochastic variable; 
by Jensen’s inequality, the expected present 
value of that stream is increased. This result 
was demonstrated by Richard Hartman 
(1972) and later extended by Andrew Abel 
(1983) and others. In their models, constant 
returns to scale and the substitutability of 
capital with other factors ensure that the 
marginal profitability of capital is convex in 
output price and input costs. Even with 
fixed proportions, however, this convexity 
can result from the ability of the firm to 
vary output, so that the marginal unit of 
capital need not be utilized at times when 
the output price is low or input costs are 
high.! 

If investment is irreversible and can be 
postponed, a second effect of uncertainty is 
to create an opportunity cost of investing 
now, rather than waiting for new informa- 
tion to arrive before committing resources. 
This increases the full cost of investing in a 
marginal unit of capital, which reduces in- 
vestment.? Hence the net effect of uncer- 


* Massachusetts Institute of Technology, Cambridge, 
MA 02139. This work was supported by the National 
Science Foundation under Grant No. SES-8618502, 
and by MIT’s Center for Energy Policy Research. My 
thanks to Julio Rotemberg and especially Ricardo 
Caballero for very helpful comments and discussions. 

'Then the marginal profit of capital at each time 1 
in the future is max[0,(p,—c,)], where c, is variable 
cost. Thus, a unit of capital represents a set of all 
options on future production, which are worth more 
the greater the variance of p, or c,. 

*For a detailed discussion of this point and a survey 
of the recent literature on irreversibility and its impli- 
cations for investment and market evolution, see 
Pindyck (1991). 


tainty on irreversible investment depends 
on the size of this opportunity cost relative 
to the increase in the value of the marginal 
unit of capital. The sign of the net effect is 
significant because of its possible policy im- 
plications. Pindyck (1988) and Giuseppe 
Bertola (1989) developed models in which a 
firm faces a downward-sloping demand 
curve and found the net effect to be nega- 
tive. 

Ricardo Caballero (1991) has recently 
studied investment by an individual firm 
facing a_ stochastically shifting demand 
curve. He shows that, as returns to scale 
become constant and the elasticity of de- 
mand faced by the firm rises, this opportu- 
nity cost of investing now rather than wait- 
ing approaches zero. Thus, in the limit of 
constant returns and an infinitely elastic de- 
mand curve, an increase in uncertainty tends 
to raise rather than decrease current invest- 
ment, even if that investment is irreversible. 

It is important to stress that Caballero’s 
limiting result treats the firm in isola- 
tion and describes the effect of a mean- 
preserving increase in price uncertainty, and 
not an increase in industry-wide demand 
uncertainty. This note shows that, in a com- 
petitive market, the key interactions be- 
tween irreversibility and uncertainty occur 
at the industry level and can only be under- 
stood by making price and industry output 
endogenous.’ Doing so restores the positive 
opportunity cost associated with irreversible 
investment. 

In addition, Caballero’s result is based on 
a model with convex adjustment costs. An 
innovative aspect of the model is that these 
costs can be asymmetric, thereby allowing 
for partial or complete irreversibility; com- 
plete irreversibility corresponds to a cost of 


Studies that make price endogenous include Steven 
Lippman and Richard Rumelt (1985), Avinash Dixit 
(1989, 1991), and John Leahy (1990). 
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downward adjustment that is infinite. The 
model is particularly useful in that it allows 
one to study the sensitivity of investment to 
the extent of asymmetry in adjustment costs. 
However, as with other models of this kind, 
the size of the firm would be unbounded 
were it not for adjustment costs: it is only 
adjustment costs that determine firm size. 
This role of adjustment costs is crucial to 
the results of Caballero and earlier authors 
regarding the effect of uncertainty on in- 
vestment. This note also shows that, while 
helpful for studying the behavior of a firm 
in isolation, this adjustment-cost framework 
(used widely in empirical studies of invest- 
ment) is inconsistent with a competitive 
market equilibrium and, hence, with the 
behavior of a competitive firm. 


I. Adjustment Costs, Competition, and 
Irreversible Investment 


A firm that has constant returns to scale 
everywhere and faces an infinitely elastic 
demand curve will have a profit function 
that is linear in the capital stock. Hence, 
convex costs of some kind are needed to 
bound the size of the firm; otherwise, the 
firm would expand indefinitely if the pres- 
ent value of the stream of incremental prof- 
its from a unit of capital exceeded the cost 
of the unit. In most adjustment-cost models 
of competitive investment, the convexity of 
adjustment costs limits the size of the firm 
by making the marginal cost of investment 
an increasing function of the level of invest- 
ment. However, because adjustment costs 
are a function of only the level of invest- 
ment (and not the stock of capital), invest- 
ment in each period is independent of in- 
vestment or the stock of capital in any other 
period. Hence, there are no linkages be- 
tween decisions in one period and the next. 

This, however, necessarily eliminates irre- 
versibility from the problem. Irreversibility 
affects today’s decisions when it causes those 
decisions to constrain behavior in the future 
under some states of nature but not under 
others. For example, consider the invest- 
ment decision of a firm facing an uncertain 
future demand. If the firm invests a large 
amount this period, it would not want to 
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disinvest next period if demand expands 
and so would not be constrained by irre- 
versibility. However, the firm would be con- 
strained if demand were to contract next 
period, because then it would want to disin- 
vest and reduce its capital stock. Because 
next period’s demand is uncertain, irre- 
versibility leads the firm to invest somewhat 
less this period. 

This effect can never arise when the size 
of the firm is constrained only by adjust- 
ment costs. Then, investment next period 
depends only on the realization of demand 
that period and on the adjustment-cost 
function; it is completely independent of 
investment this period. Hence the firm need 
only compare the marginal cost of investing 
to current and expected future marginal 
profits. If uncertainty increases expected fu- 
ture marginal profits, it will necessarily in- 
crease current investment. 

Convex adjustment costs may indeed af- 
fect the rate at which firms invest (although 
simple “time to build” and the iumpiness of 
investment are likely to be more important 
constraints). It is unrealistic, however, to 
treat adjustment costs as the sole or main 
determinant of firm and industry size in 
equilibrium. In fact, a pure adjustment-cost 
model with constant returns to scale is in- 
consistent with a competitive market equi- 
librium. The reason is that, with free entry, 
a very large number of very small firms will 
come into the industry. (Very small firms 
would enter because they would have very 
small adjustment costs and hence lower to- 
tal costs.) In the limit, the industry would be 
composed of an infinite number of in- 
finitesimally small firms, and so each firm 
would have no adjustment costs. 

Even if each firm is constrained to some 
minimum size, the possibility of entry by 
new firms or expansion of existing ones will 
ensure that investment decisions in the face 
of industry-wide uncertainty are intertem- 
porally linked. As a result, industry-wide 
uncertainty will affect irreversible invest- 
ment by a competitive firm with constant 
returns to scale much as it would a noncom- 
petitive firm or a firm with decreasing re- 
turns. The reason is that, in each period, if 
demand increases existing firms will expand 
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or new firms will enter until the market 
clears. From the point of view of an individ- 
ual firm, this limits the amount that price 
can rise under good industry-demand out- 
comes. However, if investment is irre- 
versible, there is no similar mechanism to 
prevent price from falling under bad de- 
mand outcomes. Each firm takes price as 
given, but it knows that the distribution of 
future prices is affected by the irreversibility 
of investment industry-wide. This reduces 
its own incentive to invest. 

Thus, in a competitive equilibrium, un- 
certainty over market demand affects invest- 
ment through the feedback of industry-wide 
capacity expansion and new entry on the 
distribution of prices.* The following exam- 
ple illustrates this with a simple two-period 
model that allows for this feedback. 


II. An Example 


As in Caballero’s (1991) model, each firm 
is in place two periods, there is no deprecia- 
tion or discounting, and the production 
function is Cobb-Douglas: 


(1) q;= AL{K;~* 


0<a<l 


where gq; is the output of firm 7. Since I 
concentrate on competitive firms facing 
industry-wide shocks, I begin with the mar- 
ket demand curve, rather than the demand 
curve facing an individual firm, and assume 
that it is isoelastic: 


(2) 


where e is the elasticity of demand, and Z, 
is a stochastic process, with Z, = 1. For sim- 
plicity, let Z, equal 0 or 2 with equal proba- 
bility. IT will compare this to the certainty 
case in which Z, = 1. Also, I will restrict the 
discussion to the case of complete irre- 
versibility, with no cost of adjusting K; up- 
ward. 


‘This can also be the case under imperfect competi- 
tion. 
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Let there be a large number, N, of 
equal-size firms, so that each takes price as 
given, and Q = Nq,. The profit function for 
each firm is then 


(3) = hP"K; 


where h = (1— a) /w)*/"- and 
n=1/(1—a)>1. Also, q;= BK;, where B 
=h/(i1—a). With no loss of generality, I 
choose A so that B = 1. Note that the value 
of a marginal unit of capital is hP”, what- 
ever the firm or industry capital stock. This 
value is convex in P, so its expectation is 
increased by a mean-preserving spread in 
P; but as will be seen, this need not mean 
that uncertainty leads the firm to invest 
more. 

First, consider the certainty case. Here, 
P, = P,, and all investment occurs in period 
1. Each firm will want to invest an infinite 
amount if 2hP/’>k and will invest nothing 
if 2hP? <k, where k is the cost of a unit of 
capital. Thus, in equilibrium, firms invest 
until price falls to the point that 2AP])' =k. 
Hence P,=P,=(k/2h)'/". Industry in- 
vestment in the first period is /,= K,= 
Q, =(2h/k)*/". (Each firm’s investment is 
just 1/N of this.) 

Now suppose that Z, is unknown when 
firms invest in period 1; it can turn out to be 
2 or 0, each with probability 0.5. Although 
Z, is exogenous, P, is determined as part of 
the market equilibrium. To find this equilib- 
rium, one wants a distribution for P, that 
results from Z, and from firms’ investment 
decisions, with those decisions based on this 
same distribution.° 

I will surmise that equilibrium investment 
in period 1 is small enough so that, in 
period 2, firms invest some positive amount 
if Z,=2 (whereas they invest nothing if 
Z,=0). After solving for J,, I will check 
that this is indeed the case. Then, if Z, = 2, 
firms will invest in period 2 to the point that 
the profit from a unit of capital equals its 
cost, that is, until =k, or P,= 
(k /h)'/”. This implies that K, will equal 


This is easy to do for this simple example. Leahy 
(1990) solves a more general continuous-time problem. 
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(P, /2)~* =2°(h/k)*/". Of course if Z, = 
0, then P, =0, J, =0, and K,=K,. 

Given this distribution for price in period 
2, risk-neutral firms will invest in period 1 to 
the point that the expected value of a unit 
of capital equals its cost: 


(4) + E,\(hP?) =k 


or 
+0.5k =k. 


Hence J, = K,=(2h/k)*/”. Finally, I will 
check that J, is indeed positive if Z, = 2. If 
Z,=2, then 


I, = K,=(2° -2°/")(h/k)*/">0 


since 7 > 1. 

In contrast to Caballero’s (1991) result, I 
have found that period-1 investment is the 
same when Z, is uncertain as it is when 
Z,=1 with certainty. The reason is that 
while a mean-preserving spread in the dis- 
tribution of P, increases the value of a unit 
of capital, a mean-preserving spread in Z, 
reduces the expected value of P,. The equi- 
librium response of firms limits price in- 
creases under good outcomes of Z,, but 
because of the irreversibility of investment, 
it does not limit price decreases under bad 
outcomes. In this particular example these 
two effects just offset each other, so invest- 
ment is left unchanged.° 

The Appendix extends this example to n 
periods and allows Z, to follow a random 
walk; it begins at 1 and increases or de- 
creases by 100 percent in each period. The 
Appendix shows that investment in period 1 
is lower when Z, is stochastic, as long as 
n> 3. Also, in any period, the difference 
between investment when future values of 
Z, are known and investment when they are 
stochastic grows with the number of periods 
remaining. The reason is that the variance 
of future values of Z, increases with the 


°If Z,=1 with certainty, P,;=(k /2h)'/”, but if 
Z,=0or 2, = /h)'/", which is smaller since 
7 > 1. The expected marginal profit of capital is k /2 in 
both cases. 
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time horizon, but industry investment al- 
ways limits price increases under good out- 
comes.’ 


Concluding Remarks 


The simple example presented above 
shows how industry-wide uncertainty can 
have a negative effect on irreversible invest- 
ment, even when firms are perfectly com- 
petitive and have constant returns to scale. 
That effect is mediated by the equilibrium 
behavior of all firms and the resulting im- 
pact on market price. In the two-period 
example above, that effect just offsets the 
increase in the value of a unit of capital that 
results from the convexity of the marginal 
profit function, so that period-1 investment 
is left unchanged by a mean-preserving 
spread in the demand shift variable. When 
the number of periods exceeds two, period-1 
investment is lower when future demand is 
uncertain. 

In the example, I found the equilibrium 
distribution for price and the levels of in- 
vestment in each period consistent with that 
distribution. Alternatively, I could have used 
the fact, demonstrated by Robert Lucas and 
Edward Prescott (1971), that the competi- 
tive equilibrium is the solution to the 
social-planning problem. The social planner 
will use the downward-sloping demand curve 
to calculate the optimal investment rule and 
hence will solve an _ optimal-investment 
problem that has the same structure as that 
of a monopolist with constant returns to 
scale. 

The results in this note do not imply that 
industry-wide uncertainty will have identical 
effects on investment for a competitive firm 
and a monopolist. In Caballero and Pindyck 
(1992), a model with entry and exit is devel- 
oped that helps clarify the differences be- 
tween the competitive firm’s response to 


’Like the two-period example, the n-period example 
in the Appendix ignores depreciation and discounting. 
Hence if Z, =1 for all ¢, the value of a unit of capital 
grows linearly with n, but it grows less rapidly if Z, is 
stochastic and expected future prices are lower. Includ- 
ing depreciation and discounting would reduce the 
depressive effect of uncertainty on current investment. 
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industry-wide uncertainty and that of a 
monopolist and also shows the effects of 
firm-level uncertainty (e.g., over the firm’s 
productivity). 


APPENDIX 


This appendix extends the two-period ex- 
ample to n periods, where Z, = 1 and then, 
in each succeeding period, Z, increases or 
decreases by 100 percent, with probability : 
for each. Thus, Z,=0 or 2. If Z,=0, Z, 
remains 0 for all future ¢, but if Z,=2, 
then Z,=0 or 4, and so on. As before, 
there is no depreciation or discounting. 
Hence in the certainty case (Z, = | always), 
firms invest in period | to the point that 


(Al) nhP}? = nh(K,'/*) =k 
= K,=(nh/k)*’", 1, =0 for t > 1, and 
=(k/nh)'”". 

I will again find a solution for the 
stochastic case by surmising that investment 
is positive in a good state (i.e., when Z, , , > 
Z,) and then checking that this is indeed the 
case. First, in period n, the good state is 
that in which Z, = 2”~'. In this state, firms 
invest until P,=(k/h)'/". Thus K, = 
(P, = and I, = K,, 


In period n —1, in the good state Z,,_, = 
2"-?, and firms invest to the point that 
AP”, + (hP”) =k, which implies that 


h(2"-?K7 4°)" + 0.5k =k 


n—1 


or 


Note that since 7 > 1, K,, > K,,_, in a good 
state, aS was surmised. In period n —2, in 
the good state, firms invest to the point that 
hP,”_, + En-2[hP"_,+ hP"])=k, so that 
K,, > =[2™-?*'h/k}/". In general, in a 
good state, 
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Finally, working back to period 1, J, = K,= 
(2h/k)°/". Note that this is smaller than 
the certainty case when n> 2. 
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Bargaining versus Price Competition in Markets 
with Quality Uncertainty 


By Hetmut BEsTER* 


This paper focuses on the endogenous 
determination of trading rules. In many 
markets, as for example in the retail busi- 
ness, prices are simply posted by sellers, and 
the buyer has little direct influence on how 
much he has to pay. In other markets prices 
are the outcome of bilateral negotiations, so 
that both the seller and the buyer take an 
active part in setting the price. Examples 
include not only the bazaar of a less devel- 
oped nation, but also the markets for used 
cars, real estate, antiques, and inputs for 
manufacturing firms. This paper provides a 
theoretical explanation of which pricing in- 
stitution is likely to emerge in a market 
where buyers are imperfectly informed 
about the quality of goods or services. I 
compare the performance characteristics of 
posted-offer pricing with negotiated pricing 
and find that each arrangement has specific 
merits. These determine the equilibrium 
pricing policy as the outcome of competitive 
interactions between the market partici- 
pants. 

The key insight of my analysis is that the 
price-determination aspects of market orga- 
nization can also affect quality. Suppose the 
buyer has to visit a firm to determine its 
choice of product quality and that he expe- 
riences switching costs when moving from 
one seller to another. The incentive to pro- 
vide quality in the posted-price regime is 
that the consumer may walk away to an- 
other store if the quality is too low. Switch- 
ing costs may undermine this incentive be- 
cause they create a lock-in effect. A seller 
who has a locked-in customer may reduce 
his cost by choosing a lower quality. This 


*Center for Economic Research, Tilburg University, 
P.O. Box 90153, 5000 LE Tilburg, The Netherlands. I 
thank Dilip Mookherjee, Larry Samuelson, Jonathan 
Thomas, Eric van Damme, and two referees for their 
comments. 


argument is often used to advocate self- 
enforced bans on price advertising for 
providers of professional services such as 
doctors and lawyers.'! While posted-pricing 
promotes ex ante price competition, it may 
lead to a deterioration in product quality. 
In contrast with posted pricing, bargain- 
ing determines the price of the good after 
the buyer has arrived at a store and learned 
its quality. Here the incentive to provide 
quality is generated by the fact that the 
seller gets some fraction of the total sur- 
plus. He will seek to maximize this surplus 
by selecting the socially efficient quality. As 
a result, the negotiated-price market does 
not exhibit the moral-hazard problem that 
characterizes the posted-price market. 
However, the locked-in consumer finds him- 
self in a situation of partial bilateral 
monopoly with the seller. This allows the 
seller to exploit his customer, and the bar- 
gaining may result in a relatively high price. 
The different impact of switching costs on 
price and quality in the posted- and the 
negotiated-price markets determines the 
competitiveness of these trading rules.? 
There is a considerable literature study- 
ing the formation of prices in decentralized 
markets where pairs of agents bargain over 
the gains from trade.* This literature takes 


'Yuk-Shee Chan and Hayne Leland (1982) and 
William Rogerson (1988) examine this argument. 

For a comparison of posted and negotiated pricing 
in a laboratory experiment, see James Hong and 
Charles Plott (1982). 

>This includes work by Peter Diamond and Eric 
Maskin (1979), Ariel Rubinstein and Asher Wolinsky 
(1985), Douglas Gale (1986a,b), and myself (1988). A 
different context is considered in my 1989 paper, where 
I replace the price-setting stage of the standard spatial 
competition model with a bargaining game. Further 
references and a detailed discussion of bargaining in a 
market setting are found in the monograph by Martin 
Osborne and Rubinstein (1990). 
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the trading rule as exogenously given; the 
sellers are prohibited from competing with 
each other by posting prices. The optimal 
selling strategy of a monopolistic seller is 
studied by John Riley and Richard Zeck- 
hauser (1983) and by Drew Fudenberg et al. 
(1987). Their analysis is concerned with the 
question of whether a fixed posted price 
yields a higher payoff for the seller than a 
haggling strategy. Beginning with the work 
of William Vickrey (1961), the choice of 
selling mechanisms is an important topic 
also in the auction literature.* It turns out 
that posting a fixed price is the optimal 
strategy for a monopolistic seller of a single 
good when he faces a single potential buyer. 
Similarly, with many buyers, posted pricing 
is optimal for the monopolist when he pro- 
duces the good under constant returns to 
scale and can freely vary the amount offered 
for sale (see Milton Harris and Arthur Ra- 
viv, 1981la,b). Paul Milgrom (1987) embeds 
the auction model in a noncooperative- 
bargaining model in which the owner of the 
good always has the right to resell the item 
to other traders. He finds that conducting 
an auction allows a weak bargainer to bene- 
fit from the presence of stronger bargainers, 
so that typically the sale will be by auction 
rather than by private offer. The following 
features distinguish my model from this lit- 
erature: there are many producers operat- 
ing under constant returns to scale; these 
producers can compete by advertising prices; 
buyers are imperfectly informed about 
product quality; and buyers face switching 
costs and cannot negotiate with several sell- 
ers at the same time. 

To address the problem, I study a simple 
model that allows one to derive an equilib- 
rium solution both for negotiated and posted 
pricing. It is presented in Section I. Sections 
II and III investigate the equilibrium out- 
come under both trading rules. Based on 
this analysis, I endogenize the determina- 
tion of trading rules in Section IV, where I 


4See Preston McAfee and John McMillan (1987) 
and Paul Milgrom (1987) for a survey. 
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show that for each parameter constellation 
there is a unique equilibrium pricing mech- 
anism. Concluding remarks are contained in 
Section V. 


I. The Modei 


Consider a market with N> 2 identical 
firms. Each firm produces a single good at 
constant returns to scale. Before the market 
opens, each firm decides once-and-for-all 
on the quality q€{q,,q,} of its output, 
where q, >q,. The cost of producing one 
unit of quality q is c(q) with c(q,)>c(q,). 
In the model all consumers are identical. 
They do not interact strategically with each 
other. This together with the assumption of 
constant returns to scale allows one to con- 
sider each buyer in isolation independently 
of the total number of consumers. Each 
consumer purchases at most one unit of the 
good. His utility from purchasing quality q 
at the price p is given by q-— p. Alterna- 
tively, he may not purchase the good from 
any of the N firms and consume some “out- 
side good” instead. The price and the qual- 
ity of the outside good are exogenously fixed 
so that the consumer enjoys the net benefit 
v from buying it. 

The buyer does not directly observe the 
firms’ choice of quality. He learns the qual- 
ity g sold at a particular store only by visit- 
ing the store. There is a cost to visiting a 
store. Switching from one of the WN sellers 
to another or to consuming the outside good 
takes one time unit. As the buyer discounts 
future benefits by the discount factor 0 < 
5 <1, this creates a switching cost. We will 
view 5 as a measure of these costs and 
investigate its impact on the formation of 
prices in this market. This is done under the 
following assumption: 


(1) 4,—¢(4n) > > 4,7) > 9. 


Thus, in the full-information equilibrium 
with perfect competition all firms would 
produce quality q,, and the consumer would 
buy the high-quality good at the price p= 
c(q,,). Consuming the outside good would 
yield a lower utility level. In addition, the 
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surplus from producing the low-quality good 
is taken to be too low to compete with the 
outside good. This implies that under im- 
perfect information the buyer will never visit 
a store that he suspects to offer quality q,. 
Accordingly, I can confine the analysis to 
situations where the sellers find choosing 
quality q,, to be optimal. 


II. The Negotiated-Price Market 


In the negotiated-price market the con- 
sumer has the option of purchasing the out- 
side good or visiting one of the N stores to 
bargain about the price of the good. Upon 
entering a store, he observes the quality g 
actually chosen by the seller and so the 
price negotiations proceed under symmetric 
information. The “disagreement point” in 
this bilateral bargaining situation represents 
the payoffs of the buyer and the seller, 
respectively, if no sale takes place and the 
buyer quits; it will be denoted as (d,0). Of 
course, the buyer’s payoff d depends upon 
his switching cost and the net benefit that 
he expects from bargaining with another 
seller or simply from consuming the outside 
good. Accordingly, in equilibrium, d will be 
determined endogenously. The seller’s profit 
from not making a sale is zero. He keeps a 
constant inventory of the good so that after 
each sale he incurs the cost c(q) of produc- 
ing one unit. Suppose the generalized Nash 
bargaining solution is the outcome of the 
bargaining.” Then the price upon which the 
parties agree is® 


a 


(2) (4,d) =argmax,[q— p-d] 


°The generalized Nash solution is studied by John 
Harsanyi and Reinhard Selten (1972). An axiomatiza- 
tion of this solution is given by Ehud Kalai (1977) and 
Alvin Roth (1979). Kenneth Binmore et al. (1986) 
derive the generalized Nash bargaining solution as the 
noncooperative equilibrium of an extensive game in 
which the parties alternate in making offers and coun- 
teroffers. 

°Of course, an agreement will be reached only if the 
surplus q —c(q)—d is nonnegative. This condition is 
always fulfilled in the equilibrium defined below. 
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with 0 <a<1. This solution splits the sur- 
plus so that the buyer receives the fraction 
a. The parameter a may therefore be inter- 
preted as expressing the buyer’s “bargaining 
power’; by varying a from zero to unity, 
one can obtain any price that is individually 
rational both for the buyer and the seller. 
Much of my analysis will focus on the joint 
impact of the bargaining parameter a and 
the friction parameter 6 on the market out- 
come. 

In the event of breakdown in the negotia- 
tions, the buyer can either switch to another 
bargaining partner or he can purchase the 
outside good. As the surplus gq, —c(q,) is 
less than v, the buyer will not go to one of 
the N stores unless he is convinced that he 
will find quality q,.’ In the equilibrium of 
the negotiated-price market the buyer ex- 
pects high quality, and he anticipates that 
the bargaining will result in some price p. 
Given these expectations and the delay cost 
of switching, his expected utility from dis- 
agreement is 


(3)  d(p) =max[5v,6(4,— p)]. 


In equilibrium the consumer’s price—quality 
expectations have to be consistent with the 
market outcome. 


Definition: p is a negotiated-price equilib- 
rium if (i) q,-p2v and p>c(q,); (ii) 
p=(q,,d) and d=d(p); and (iii) p- 
c(q,,) = ¢(q,,d)— c(q,). 


The first of these conditions ensures that 
both the sellers and the buyers are willing 
to participate in the market. If (i) fails to 
hold, then none of the N sellers is active, 
and the consumer purchases the outside 
good. By (ii), the equilibrium price p is 
determined by the bargaining solution, tak- 
ing into account that the buyer’s threat point 


“As will be shown below, choosing qy, is a dominant 
strategy for each seller. This precludes the possibility 
of a mixed equilibrium in which some sellers choose q,, 
and others choose q,. 
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III IV 


FiGurRE 1. NEGOTIATED PRICING Is AN 
EQUILIBRIUM IN Reaions II AND IV; PosTED 
PrictnG Is AN EQUILIBRIUM IN REGIONS I AND II 


in the price negotiation is d(p). Finally, (iii) 
guarantees that each single seller finds it in 
his own interest to select quality q,. In 
equilibrium, the buyer is indifferent among 
all stores so that the sellers share the mar- 
ket equally. To state the main result of this 
section, I define the function 


v—alq,—c(4y)] 
(l-a)v 


(4) = 


By assumption (1), 65,(+) is decreasing, 
6,(0)=1, and 6,(a)>0 for all a<vu/ 
— c(q,)]. In Figure 1 the function 6= 
5,(a@) represents the borderline between the 
two regions I+ III and II+IV. 


PROPOSITION 1: There is a_ negotiated- 
price equilibrium if and only if 5 > 6,(a). If 
5 > 6,(a@), the negotiated-price equilibrium is 
unique with 


(1—a)(1—54)q, + ac(q,) 
1—(l-a)é 


p= 


PROOF: 
The generalized Nash bargaining solution 
is defined by the necessary and sufficient 
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first-order condition 


(5) 


Accordingly, 
o(q,d)—c(q) 


so that by assumption (1) equilibrium condi- 
tion (iii) is satisfied for any p satisfying 
condition (ii). By the first inequality of con- 
dition (i), one must have d(p) = 6(q, — p). 
Therefore, solving (ii) for p yields the 
unique solution stated in the proposition. 
This solution always satisfies the second in- 
equality in (i); the first inequality in (i) is 
identical to alq, —c,]/[1—( — a)6]= v. By 
the definition of 6,, this is equivalent to 
5 > 6,(a@). 


The inequality 6>6,(q@) is satisfied in 
regions II and IV of Figure 1. For these 
parameter constellations, the consumer pur- 
chases the high-quality good at the price p. 
As p exceeds c(q,,), the presence of market 
frictions enables the sellers to earn positive 
profits. These are higher, the lower are 6 
and a. Interestingly, p approaches c(q,) 
both in the limit when 6—1 and in the 
limit when a > 1. The first of these proper- 
ties justifies viewing the perfectly competi- 
tive outcome as the limiting point of a mar- 
ket with negligible switching costs. 

Why does the consumer purchase the 
outside good for values of 6 and a in re- 
gions I and III, where a negotiated-price 
equilibrium fails to exist? The reason is that 
these parameter values violate equilibrium 
condition (i). With high switching costs and 
little bargaining power the buyer cannot get 
a favorable deal once he has entered a 
store. Knowing this ex ante keeps him from 
going to a store and induces him to con- 
sume the outside good. This kind of market 
failure is well known from the standard 
search model (see e.g., Joseph Stiglitz, 1979) 
in which prices are unilaterally set by sell- 
ers, and buyers have to visit stores to ob- 
serve prices. Indeed, in the limiting situa- 
tion where a=0O the negotiated-price 


= 
=— 


market becomes identical to the simplest 
search model with identical buyers and sell- 
ers. When a=0 the seller has all the 
bargaining power and effectively makes a 
take-it-or-leave-it offer to his customers. 
Proposition 1 shows that in this case p = q,, 
(i.e., each seller would charge the con- 
sumer’s reservation price). This observation 
is the well-known “monopoly price paradox” 
of Diamond (1971): even with arbitrarily 
small search costs and a large number of 
firms, in market equilibrium the price is the 
monopoly price. If the buyer has to pay a 
cost for entering the market or can opt for 
some outside good, this paradox predicts 
total market failure. In regions II and IV 
this unsatisfactory outcome is avoided be- 
cause a is large enough to guarantee the 
buyer a sufficient share of the gains from 
trade. 

Importantly, the proof of Proposition 1 
reveals that the negotiated-price market in- 
volves no problem of moral hazard; that is, 
the incentive constraint (iii) is never bind- 
ing. This perhaps surprising observation has 
a simple intuition: the bargaining outcome 
guarantees the seller a fraction (1— a) of 
the bargaining surplus. As a result, he is 
always better off by producing gq, because 
this quality yields a higher surplus than does 
q,- Choosing quality q,, is a dominant strat- 
egy for the seller in the negotiated-price 
market.’ This fact distinguishes this market 
from the posted-price market, which I turn 
to in the next section, where prices are set 
before the consumer becomes aware of qual- 
ities. 


*In the literature, the typical way to avoid the 
Diamond result has been to assume that some con- 
sumers are well informed about prices (see Steven 
Salop and Stiglitz, 1977; Hal Varian, 1980; Dale Stahl, 
1989). 

This conclusion is robust against affine transforma- 
tions of the seller’s and buyer’s utilities. The reason is 
that the generalized Nash bargaining solution satisfies 
the axiom of independence of utility rescalings. 

The relationship between ex ante and ex post 
pricing in a model without qualitative uncertainty is 
explored by Gale (1988). 
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III. The Posted-Price Market 


In the posted-price market, the sellers act 
as Bertrand competitors by posting prices. 
Advertising price information enables the 
seller to guarantee his customers a price 
before they visit his store. In contrast we 
assume that communication of quality infor- 
mation is infeasible. This assumption is rel- 
evant in markets where quality is either too 
costly to communicate or is not verifiable to 
third parties. If a court finds it hard to 
determine whether a seller actually provides 
“high” quality, then buyers must distrust 
quality advertisements because they cannot 
be enforced. 

The buyer observes the sellers’ price quo- 
tations and compares their attractiveness 
with the outside-option utility v. After en- 
tering a store and learning its quality he can 
either make a purchase at the posted price 
or switch to another seller. By assumption 
(1), he will not go to a store if he anticipates 
finding quality q,.'' In the posted-price 
equilibrium the buyer expects q, in each 
store, and so all sellers post the same price 
p*. As all stores appear identical to the 
buyer, he visits one of them at random. To 
confirm his expectations, competition must 
induce the suppliers to offer quality q,.'? As 
q is not directly observable, each single seller 
has an incentive to select q, at the posted 
price p* only if the buyer would quit after 
observing quality q,. Given his expectations 
about qualities at other stores, the buyer 
will certainly do so if q, — p* <d(p*). In 
fact, we will assume that the buyer refuses 
to purchase low quality unless he is not 
actually worse off by departing.'* This means 


Of course, the buyer presumes that no seller of- 
fers the good at a price below cost. 

!27 assume that each seller selects g, when he is 
indifferent between q,, and 

“This tie-breaking rule is necessary to avoid the 
open-set problem that would occur if there were no 
lowest price that signals high quality. Indeed, equilib- 
rium condition (iii) in the next (posted-price equilib- 
rium) definition is impossible to satisfy if condition (ii) 
is replaced by p* > 2(p*). 
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that each single seller will choose quality q,, 
if and only if the equilibrium price p* satis- 
fies the restriction p* > 7( p*), where 


(6) 


While p* > 7(p*) ensures the provision 
of high quality, a key factor in the determi- 
nation of p* is that prices are a signal of 
quality. Should some seller deviate from p* 
by posting p < p*, then the buyers will use 
the observed price to draw inferences about 
this seller’s quality. If they interpret p as a 
signal of quality q,, they will not be at- 
tracted even though p< p*. The opposite 
happens if p is regarded as a signal of 
quality q,,. As in other signaling games, such 
an indeterminacy of out-of-equilibrium be- 
liefs may lead to a multiplicity of equilib- 
rium prices p*. To avoid this problem, I will 
restrict the buyers’ beliefs to satisfy the “in- 
tuitive criterion” proposed by In-Koo Cho 
and David Kreps (1987). Suppose some 
seller wants to undercut his competitors by 
some price p slightly below p* and, at the 
same time, wishes to convince the consumer 
that he offers high quality. Then one may 
reasonably assume that this seller succeeds 
if he would not gain by posting p and 
selecting quality q,, even if his offer attracts 
the entire market. This prerequisite is ful- 
filled if low quality deters the customer from 
paying p for the good, that is, if p > 7(p*). 
Summing up, in the posted-price equilib- 
rium p* only prices p> 7r(p*) are consid- 
ered as a signal of high quality. 


Definition: p* is a posted-price equilibrium 
if (i) q, — p* =v and p* >c(q,,); (ii) p* = 
ar( p*); and (iii) there is no p > 7r(p*) such 
that p< p* and p-— c(q,)>I[p* 
c(q,)I/N. 


The first of these conditions is the same 
as in the definition of the negotiated-price 
equilibrium. Requirement (ii) represents the 
sellers’ incentive-compatibility constraint to 
provide high quality. Condition (iii) pre- 
cludes any of the sellers gaining by unilater- 
ally posting some price p below p* that 
signals high quality. Here we assume that if 
all sellers post the same price p*, each 
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store has the same chance of attracting con- 
sumers so that its market share equals 1/N. 
The equilibrium outcome depends on the 
level of switching costs. Let 


(7) 53 =1-(4n- 97) /v- 


Then, 5, <1 because q, > q,. 


PROPOSITION 2: There is a posted-price 
equilibrium if and only if 5>65,. If 56> 5g, 
the posted-price equilibrium is unique with 
p* = maxic(q,),(q, — — 5). 


PROOF: 

By the first inequality in equilibrium con- 
dition (i), one has (p*) = q, — 5(q,, — p*). 
As p* >c(q,,) and N22, condition (iii) is 
satisfied if and only if p* minimizes p sub- 
ject to p>c(q,) and p>(p*). For 6= 
La, — — only the first con- 
straint is binding and so one has p* = c(q,). 
Otherwise, only the second constraint is 
binding so that p* =7(p*), that is, p* = 
(q, — 5q,)/A— 6). If p* =c(q,), the first 
inequality in (i) is always satisfied. For p* > 
c(q,,) this inequality becomes q,— p* = 
(q,, — 6) = v which, by definition of 
is identical to 6 > 65. 


The posted-price equilibrium may fail to 
exist for low values of 6 if 5, > 0, that is, if 


(8) Qn-¢ <U- 


This condition implies that the consumer 
prefers buying the outside good to purchas- 
ing quality q, at a price p > qy,. It limits the 
use of prices as signals of quality in the 
posted-price market. Even though the buyer 
may reasonably be convinced that prices 
above q, indicate high quality, because he 
would always quit a low-quality store with 
p>gq,, he cannot be attracted by such a 
price offer. More generally, the lock-in 
effect becomes more serious when the dif- 
ference between q, and q, is decreased. 
This is so because the consumer’s utility 
gain from quitting a low-quality seller and 


going to a high-quality seller becomes 
smaller.'4 

Regions I and II of Figure 1 describe the 
area where a posted-price equilibrium ex- 
ists. For values of 6 below 6, the lock-in 
effect becomes too strong and leads to a 
deterioration of product quality. As a con- 
sequence, the institution of posted-offer 
pricing precludes the sellers from being ac- 
tive in the market and results in consump- 
tion of the outside good in regions III 
and IV. 

For 6 close enough to unity, the posted 
price p* equals c(q,,). For lower values of 
6, however, one observes that price exceeds 
marginal costs. The increase in the price 
above the cost of producing high quality 
gives the firms an incentive not to offer low 
quality. This is a familiar feature of search 
models in which prices signal product qual- 
ity (see Asher Wolinsky, 1983; Rogerson, 
1988). A similar mechanism appears also in 
experience good markets with repeat pur- 
chases, in which quality is learned after the 
good is bought. In the Benjamin Klein and 
Keith B. Leffler (1981) model, consumers 
pay a quality premium which prevents the 
seller from cutting quality. Michael Riordan 
(1986) assumes that firms make price and 
quality decisions which cannot be altered 
over the relevant period. A high price then 
signals a high quality since consumers would 
refuse a second purchase of low quality at a 
high price. In my model, the consumer 
learns about quality before buying the good 
and so he never purchases low quality at a 
high price. 

The negotiated-price market is clearly 
more efficient than the posted-price regime 
in region IV of Figure 1. Here the N firms 
remain inactive in the posted-price market, 
whereas negotiated pricing results in pro- 
duction of the high-quality good with posi- 
tive payoffs both for the sellers and the 
buyers. In contrast, the posted-price market 


The same argument shows that the equilibrium 
quality must unravel to qg, when quality is a continuous 
choice variable within some interval [q,,q,,]. To avoid 
total market failure in this situation, one has to assume 
that some consumers are well informed about quality. 
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appears to be superior in region I, where 
the buyer refrains from entering negotia- 
tions because his bargaining power is too 
low. The key insight from Propositions 1 
and 2 is that the two categories of trading 
institutions involve a trade-off. Price bar- 
gaining avoids the moral-hazard problem in 
the firms’ selection of qualities; yet, as the 
price is determined ex post after the buyer 
has chosen the seller, it may not guarantee 
the buyer a sufficient fraction of the surplus 
to make bargaining attractive ex ante. Ex 
ante pricing, as in the posted-price market, 
does not suffer from this drawback; but, 
when the price is fixed ex ante, the lock-in 
effect may have a negative impact on the 
seller’s incentive to produce high quality. 
The relative importance of these considera- 
tions depends on the parameters 6 and a. 
The following section will demonstrate that 
the trade-off between the two pricing insti- 
tutions can explain which will emerge as an 
equilibrium trading rule in a given environ- 
ment. 


IV. The Stability of Competition 


This section is devoted to analyzing which 
pricing mechanism is stable against compe- 
tition. A particular trading rule can survive 
only if no trader can gain by deviating and 
using another trading rule. Applying this 
idea to the negotiated-price market means 
that no seller should be able to profit from 
posting a price ex ante in a situation where 
all the other sellers rely on ex post pricing. 
That is, it must be impossible to attract 
profitably the demand of all consumers by 
undercutting the negotiated-price equilib- 
rium p and posting p < p. The reason why 
such an attempt may fail is that prices be- 
low p may be viewed as an indication of low 
quality. Using the same restrictions on be- 
liefs as in Section III, I will assume that the 
posted offer p convinces the consumer of 
quality q, only if p>7(p), where z(-) is 
defined by (6). 


Definition: The negotiated-price equilib- 
rium p is stable against price competition if 
there is no p < p such that (i) p > 7(p) and 
(ii) p — c(q,) p — cla, N. 
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III IV 


FIGURE 2. NEGOTIATED PRICING Is STABLE IN 
Reaions I] AND IV; Postep PriciNG Is STABLE 
IN ReGIons I ANp II* 


In other words, the institution of negoti- 
ated pricing cannot be eroded by price- 
posting if any offer below / is either viewed 
as a low-quality signal or fails to increase 
the seller’s profit even when he serves the 
whole market. To determine the range of 
parameter values for which this is true, I 
define the function 


(9) 


€(4n))—(4n 


= 
a(dy @) (4, 97) 


Note that for 


one has 

54(a*) = 5, =5-(a*). 
Moreover, 5¢(a)>0 for all a €(a*,1). In 
Figure 2 the function 6 = 6-(a) is depicted 


for a €(a*,1); it divides the former region 
II of Figure 1 into the regions II* and II. 


PROPOSITION 3: The negotiated-price 
equilibrium p is stable against price competi- 
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tion if and only if a= a* and 6 < 6-(a). 


PROOF: 

As p> c(q,,) and N > 2, there is always a 
p <p Satisfying (ii) in the definition of sta- 
bility. As q,, — p= v, condition (i) is identi- 
cal to p> 5(q,— p). Therefore, p is 
stable if and only if there is no p< p satis- 
fying (i). This means one must have p< 
qa, — 5(q,,— p). Using p from Proposition 1, 
this condition is equivalent to 


(10) @)(4n - 


< 


As 6 <1, this inequality cannot hold if the 
left-hand side is negative. Accordingly, by 
the definition of 5-(a@), (10) holds if and 
only if a>{q, — cay) + 4n 
=a and 6< 6-(a). Note that 
5-(a) is strictly increasing for a>a@ and 
that 6,(a*) = 6-(a*). Accordingly, by 
Proposition | there is no negotiated-price 
equilibrium p for a €(@,a*) and 5 < 6a). 
This means that condition (10) applies if 
and only if a>a* and 6 <6(a). 


Proposition 3 states that negotiated pric- 
ing cannot be sustained as a Nash equilib- 
rium in the firms’ choice of pricing policies 
for parameter constellations in regions I 
and II* of Figure 2. Negotiated pricing con- 
stitutes a stable equilibrium only in regions 
II and IV. At first sight it may appear 
paradoxical that for 6<6-(1) the sellers 
will rely on price bargaining when the 
buyer’s bargaining power is rather high. This 
is so, however, because competition forces 
sellers to adopt a trading rule that is advan- 
tageous for the buyer. 

Interestingly, notice that a stable negoti- 
ated price market necessitates a certain 
amount of market frictions. As 5-(1)<1, 
bargaining is not a stable pricing institution 
when 6 is close to unity. Negotiated pricing 
is unlikely to survive in a highly competitive, 
almost frictionless environment. The empir- 
ical implication is that bargaining tends to 
be replaced by posted pricing when im- 


_= 
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provements in the communication and 
transportation technology reduce the buyer’s 
search cost. Switching costs in combination 
with imperfect quality information provide a 
role for negotiated pricing in regions II and 
IV. This may explain why there is relatively 
little haggling in markets where the con- 
sumer is well informed about product qual- 
ity as, for example, in the market for mass- 
produced brand goods. Also, bargaining is 
observed less frequently in the market for 
new automobiles than in the used-car mar- 
ket, where quality uncertainty is more im- 
portant. Other examples for negotiated 
pricing are the markets for antiques and 
real estate, where the buyer has to inspect 
quality before making a purchase. 

To complete the analysis, I now investi- 
gate the stability of posted pricing. I assume 
that posting p* legally commits the seller 
only in the sense that he cannot ask his 
customers to pay more than p*. However, 
this does not constrain the parties not to 
revise jointly the terms of the transaction. If 
in the course of bargaining they both reach 
an agreement, then this replaces the posted 
price. Accordingly, the buyer accepts the 
seller’s posted offer p* only if he does not 
see a chance to pay less after bargaining. 


Definition: The posted-price equilibrium p* 
is stable against bargaining if, given d= 
d( p*), it is the case that 9(q,,d)> p*. 


Given that the buyer cannot induce a price 
reduction by bargaining, he has to pay p* 
after switching to another store. Therefore, 
his threat point in a stable posted-price 
equilibrium is d(p*), as defined by (3). 


PROPOSITION 4: The posted-price equilib- 
rium p* is stable against bargaining if and 
only if either a<a* or a>a* and 65> 
5(a). 


PROOF: 

Using (5), p* is stable if and only if 
p* <(1- — d(p*)] + ac(q,). As 
d( p*) = 8(q,, — p*), this is equivalent to 


(11) p*<[(1-a)(1-8) 
+ ac(q,)]/[1-(1— @)8]. 
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By Proposition 2 this condition always holds 
if 5 — c(q,)I/la, — clq,)] because 
then p* =c(q,). For 5 € —c(q,)]/ 
[q, — c(q,)), one has p* =(q, — 6q,)/ 
(1-6), so that (11) is identical to 


> - 


As 0<6<1, (12) always holds if the left- 
hand side is negative, that is, if 


clan) +n - 97] =a. 


For a> @, (12) is equivalent to 6 > 5-(a). 
By Proposition 2, p* exists if and only if 
5>6,. As 6,>56-(a) for a €(@,a*), any 
p* is stable if a<a*. For a€[a*,1), one 
has 63 <6-(a) <[q, — c(q,)I/Lay 
so that over this range (11) holds if and only 
if 5 > 5-(a). 

Propositions 3 and 4 demonstrate that, 
whenever the WN sellers are active in the 
market, a unique stable pricing institution 
emerges. As the stability criterion elimi- 
nates posted pricing in region II of Figure 2, 
our model predicts that Bertrand competi- 
tion will prevail in regions I and II* and 
negotiated pricing will prevail in regions II 
and IV. Using Propositions 1 and 2, it is 
easily established that p* <p in region II* 
whereas p< p* in region II. The endoge- 
nous determination of trading rules thus 
maximizes the consumer’s equilibrium util- 
ity. In region II* the sellers are trapped in a 
prisoner’s dilemma type of situation. They 
all end up with lower profits because the 
negotiated price p makes undercutting 
profitable. In contrast, in region II the sig- 
naling effect associated with posted pricing 
results in a price level p* that makes bar- 
gaining more efficient for coping with the 
moral-hazard problem. 


V. Conclusion 


I have explored how different pricing 
mechanisms affect the determination of 
quality and price in a market with quality 
uncertainty and switching costs. In sum- 


<} 
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mary, posted pricing involves moral hazard, 
whereas negotiated pricing is not very com- 
petitive. I have shown that in my model this 
trade-off between the two trading institu- 
tions uniquely determines the equilibrium 
pricing policy. The equilibrium trading 
mechanism has the interesting property that 
it ensures the consumer the highest possible 
utility level. 

My model stresses the role of quality 
uncertainty for the determination of 
pricing rules. Of course, this leaves out a 
number of other considerations that may be 
important. For instance, I have assumed 
that bargaining proceeds under symmetric 
information so that negotiations are cost- 
less. Asymmetric-information bargaining 
models can generate costs in the form of 
delay in agreement.'> This may favor posted 
pricing. In general, however, it is not clear a 
priori which is the most efficient pricing 
institution when information and incentive 
problems are involved. 
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Rigging the Lobbying Process: An Application 
of the All-Pay Auction 


By Micuaet R. Baye, DAN Kovenock, AND CASPER G. DE VRIES* 


Why do politicians frequently “announce” 
that they have narrowed down a set of po- 
tential recipients of a “prize” to a slate of 
finalists?! In general, does the slate of fi- 
nalists comprise the “best” candidates, and 
does the best candidate always win?? This 
paper provides answers to these questions. 
Our model of the political process is one of 
rent-seeking, which takes the (perhaps 
overly jaded) view that persons with power 
award political prizes on the basis of self- 
interest. 

In a world where a politician can explic- 
itly auction off a prize to the high bidder, 
the standard auction literature can be used 
to analyze political behavior. The justice 
system, however, precludes politicians from 
explicitly selling the prize to the highest 
bidder; thus politicians cannot let it become 
public knowledge that they are in the busi- 
ness of selling political favors. 

An interesting institution has emerged in 
political markets to overcome this con- 
straint: lobbying. Lobbyists make implicit 
payments to the politician, through cam- 
paign contributions or “wining-and-dining.” 
If these up-front payments were rebated to 
those failing to receive the prize, it would 
be clear that the politician was selling fa- 


*Baye: Department of Economics, Pennsylvania 
State University, University Park, PA 16802; Kovenock: 
Department of Economics, Purdue University, West 
Lafayette, IN 47907; de Vries: Department of Eco- 
nomics, Erasmus Universiteit Rotterdam, P.O. Box 
1738, 3000 DR Rotterdam, The Netherlands. We are 
grateful to Donald Deere, Tom Gresik, Tim Gronberg, 
Bill Nielson, Guogiang Tian, Steve Wiggins, and three 
anonymous referees for suggestions that led to im- 
proved content and exposition. 

'The International Olympic Committee, for in- 
stance, selected six cities as “finalists” for the 1996 
Summer Olympics: Belgrade, Manchester, Toronto, 
Melbourne, Athens, and Atlanta. 

Atlanta won the bid for the 1996 Olympics. 


vors. It is natural, therefore, for a political 
institution to arise such that lobbyists “ante 
up” before the prize is awarded, and these 
up-front payments are not refunded to those 
failing to win the prize. This view of lobby- 
ing has a structure isomorphic to the all-pay 
auction, which differs from standard auc- 
tions in one principal respect: all bids are 
forfeited by the bidders. 

Before we describe our model of the lob- 
bying process, it is useful to provide an 
overview of the existing literature and to 
contrast it with the present analysis. The 
case in which more than two lobbyists value 
the prize identically was first analyzed by 
Hervé Moulin (1986), who characterizes the 
symmetric equilibrium to the all-pay auc- 
tion. Similar analysis is provided by Arye 
Hillman (1988), who argues that the equilib- 
rium is unique. It turns out, however, that 
the symmetric equilibrium is not unique; in 
fact there is a continuum of equilibria (in 
Baye et al. [1990], we provide a full charac- 
terization of the equilibria.) 

The case in which some lobbyists value 
the prize more than others has been ana- 
lyzed by, among others, Hillman and John 
Riley (1989), who argue that equilibrium 
involves only the top two lobbyists. In this 
Review, Tore Ellingsen (1991) has consid- 
ered the interesting case in which one lob- 
byist values the prize more than n —1 com- 
petitors with common valuations (see his 
proposition 1), and he demonstrates the ex- 
istence of n equilibria. Baye et al. (1990) 
have shown, however, that there actually 
exists a continuum of equilibria in this case. 
Moreover, the expected revenue earned by 
the politician differs across this continuum 
of equilibria; there is not revenue equiva- 
lence across the equilibria. 

The present analysis provides a simple 
closed-form expression for expected rev- 
enues that is valid for all equilibria. Our 
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technique does not require an explicit calcu- 
lation of the Nash-equilibrium mixed strate- 
gies. Consequently, our results provide a 
framework with which one may reexamine 
the implications of the equilibria missed for 
previous results, without explicitly calculat- 
ing the (uncountable infinity) of equilibria. 
The objective of the present paper is to 
take into account the continuum of equilib- 
ria and to determine the amount of rents 
the politician can expect to earn given such 
a political institution. To answer this ques- 
tion, we model the political process as a 
two-stage game of complete information. In 
stage 1 the politician takes the political in- 
stitution of lobbying as given but is free to 
constrain the process by “narrowing down” 
the slate of candidates to a set of “‘finalists.” 
In stage 2, the finalists compete in an all-pay 
auction: the lobbyist giving the greatest bribe 
wins the prize, while the others receive 
nothing for their payments. We solve this 
decision problem by backwards induction, 
solving first for the Nash-equilibrium pay- 
ments that accrue to the politician in the 
second-stage lobbying game given an arbi- 
trary set of lobbyists. We then solve for the 
optimal first-stage decision of the politician, 


which involves the selection of the set of 


“finalists” that maximizes expected political 
rents. We will show that, under plausible 
circumstances, the politician has a perverse 
incentive to preclude lobbyists most valuing 
the prize from participating in the second- 
stage lobbying game. Intuitively, this pre- 
commitment may take the form of announc- 
ing prior to any lobbying that “five states 


have been selected as finalists for the site of 


a new military base.” We will refer to this 
precommitment as the exclusion principle. 
The exclusion principle has obvious implica- 
tions for efficiency; states deriving the great- 
est economic benefit from a military base 
(and hence having the highest valuation of 
the prize) may be excluded a priori from the 
announced set of finalists. 


I. The Model 
Consider a politician who must determine 


which of n > 2 lobbyists will receive a prize. 
The value of the prize to lobbyist i is v; > 0, 
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where the v,’s are common knowledge and 
ordered such that v 20, 

The politician does not care which lobby- 
ists wins the prize but does care about how 
much money he has available in his cam- 
paign chest. Accordingly, he decides to 
award the prize to the lobbyist who gives 
him the greatest up-front, nonrefundable 
implicit bribe. The objective of the politi- 
cian is to select a set of lobbyists (the set of 
finalists) that maximizes his expected rents, 
W = EX"_,b;, where b; is the bribe paid by 
player 7. 

Given a set of “finalists,” lobbying is an 
all-pay auction: the payoff to lobbyist 7 if he 
offers a bribe of b, is 7; = v; — b; if b; is the 
highest of all n bribes. However, if some 
other lobbyist offers a higher bribe, lobbyist 
i’s payoff is 7, = — b;. We assume that when 
multiple lobbyists submit the highest bribe, 
the prize is awarded to one of them at 
random. Thus, the payoff of lobbyist i is 
given by 


if b> b, WiFi 


i if i ties M — 1 others 
for high bid 


— 5; if b; < b; for some 
J#il. 


This payoff structure is standard in the lob- 
bying literature (cf. Hillman, 1988). It can 
also be viewed as the limiting case of an 
alternative payoff structure suggested by 
Gordon Tullock (1980) that is also used in 
this literature (see Baye et al., 1989). In 
Section II we characterize the expected 
payments by lobbyists in the second-stage 
lobbying game. These results are used in 
Section III to determine the politician’s 
rent-maximizing selection of the set of 
“finalists.” 


II. The Lobbying Game 


We first sketch a proof of the nonexis- 
tence of a pure-strategy Nash equilibrium 
for the all-pay auction. Consider the two- 
player case and suppose (b,,b,) did com- 
prise a pure-strategy Nash equilibrium 


| 
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(without loss of generality, suppose b, > b,). 
If b, <v,, player 2 could deviate to earn a 
higher payoff by increasing b, slightly above 
b, to win the prize. If b,>v,, player 2’s 
best reply to b, is zero; but with b, =0, it 
pays player 1 to deviate from b, by lowering 
the bid to (small) «> 0, contradicting the 
hypothesis that b, > v>. 

It is known, though, that there does exist 
an equilibrium in mixed strategies, in which 
lobbyists randomize their bribes (cf. Partha 
Dasgupta and Eric Maskin, 1986; Moulin, 
1986; Hillman and Riley, 1989; Baye et al., 
1990). Moreover, with more than two play- 
ers, there generally exists a continuum of 
possible equilibria (Baye et al., 1990). As 
our focus centers around the politician’s 
rent-maximizing selection of finalists, we 
need only characterize the expected total 
bribes that accrue in a given Nash equilib- 
rium of the lobbying game. The innovation 
is that the techniques employed below do 
not rely on the algebraic form of the mixed 
strategies used by the lobbyists in equilib- 
rium and, thus, are valid even in the pres- 
ence of a continuum of Nash equilibrium 
mixed strategies. 

The following theorem is the key ingredi- 
ent that enables us to determine the set of 
finalists that maximizes the politician’s rents. 
The novelty of the result is that it is valid 
for each equilibrium in the continuum of 
possible Nash equilibria and thus can be 
used for purposes beyond the present pa- 
per. For example, the formula allows one to 
strengthen the results of Ellingsen (1991), 
which are based on a finite subset of equi- 
libria. 


THEOREM 1: Let de- 
note the valuations of lobbyists {1,2,...,n} in 
the stage-2 lobbying game. Let E,b, denote 
the expected bid of a lobbyist with the highest 
valuation. Then in any Nash equilibrium, 


Let F(b;) denote the cumulative distribu- 
tion function of lobbyist i in an arbitrary 
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(mixed-strategy) Nash equilibrium, and let 
S, denote the support of the distribution. 
Lobbyist i must earn constant (expected) 
profits almost everywhere (a.e.) in S;. For 
lobbyist 1 this constant must equal v, — v3, 
and for lobbyists 2,3,...,”, this constant is 
zero (see Baye et al., 1990).> Hence, the 
following conditions must hold: 


a.e.on S, 
and 


j#i 


[—b,]=0 


j#i 
a.e.on §;,i#1. 


Let p,(b;)=11},,F,(b;) denote the probabil- 
ity that lobbyist i wins the prize, conditional 
on his bid and the strategies employed by 
the other n —1 lobbyists in a Nash equilib- 
rium.* Then, since equations (3) and (4) 
hold almost everywhere in their respective 
supports, taking the expectations of these 
equations and manipulating reveals that 


(5) Pw,— E,b,=0,- v2 
and 


(6) Pv.—E,b,=0 Wi¥1 


where E, denotes the expectation with re- 
spect to lobbyist j’s (equilibrium) mixed 


*Note that when lobbyist 2’s valuation equals that of 
lobbyist 1, — v2 =0. 

“We can rule out mass points for any agent at a bid 
b > 0 (see Baye et al., 1990). 


V> 
(2) W=—v,+ < Up. 
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strategy and P;= E;p,b;). Summing over 
equations (5) and (6), we then obtain 


(7) We E,b, 


j=l 


=(P,-1)v,+ P.v; + 


Applying the fundamental theorem of inte- 
gral calculus to L7_,P,, it follows that 
j-1P,=1. Furthermore, if v,>v;, i> 2, 
then P,=0 (see Baye et al., 1989, 1990). 
Hence, 


(8) W=(P,-1)v,+(2— 
Rearranging (5), we find 


E,b, 

which, inserted into (8), yields our results. 


Two implications of Theorem 1 are worth 
noting. First, if two or more players most 
value the prize at some common level, v, 
the expected rents accruing to the politician 
equal v; there is full rent dissipation. Sec- 
ondly, if v,;>v,, then the expected rents 
accruing to the politician are strictly Jess 
than v,, since E,b, <v, in any Nash equi- 
librium. In other words, regardless of 
whether there is a unique equilibrium® or a 
continuum of equilibria,° in every equilib- 
rium there is underdissipation of rents. In 
the following section, this result will be used 
to establish when it pays a politician to 
preclude some lobbyists from competing in 
the lobbying game. First, however, we state 
the following lemma from Hillman and Ri- 
ley (1989). 


LEMMA 1: Suppose that the valuations of 
the lobbyists in the stage-2 lobbying game are 
such that >U,. Then in 
the unique Nash equilibrium, Eb, = v, /2. 


~The equilibrium is unique when v, > v3. 
There is a continuum of equilibria when v5 = v3. 
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Theorem 1 and Lemma 1 together imply 
that, when two players value the prize 
strictly more than all other players, the ex- 
pected rents accruing to the politician in the 
stage-2 lobbying game are 


(9) W(v,,U2) = 


Note that, when v, > v, > v3, expected rents 
are increasing in v, but decreasing in vj. 
Intuitively, as player 1’s valuation increases, 
the playing field becomes more unequal. 
Hence, player 2 reduces his expected pay- 
ment to the politician, and total expected 
rents decline. 

It is important to note that the formula in 
equation (9) is based on specific configura- 
tions of valuations and does not hold in 
general (it does not hold when v, > v, = v3). 
The reason is that, when v, >v,=U3, Eb, 
in equation (2) varies depending upon which 
of the continuum of equilibria is played, 
and thus, the politician’s expected rents de- 
pend upon which equilibrium the lobbyists 
play. This point has not been addressed in 
the existing literature, and it plays a crucial 
role in our analysis. 


III. Selecting the Finalists 


Since there exists a continuum of ex- 
pected political rents for some config- 
urations of valuations, our next task is to 
characterize properties of the maximum ex- 
pected political rents that can be extracted 
from the lobbyists. 


PROPOSITION 1: /f {i,...,/} is a rent- 


maximizing set of finalists (with valuations 
=0,,), then expected rents are 


é 
(10) 


PROOF: 

We must show that if {1,..., 7} is a set of 
finalists that maximizes expected rents (and 
the corresponding valuations are 0, > --- > 


v,| 2 
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b,,), then expected rents are W(i,,0,). This 
is clearly true if m= 2; hence, suppose m > 
2. If 6,=6,=0, equation (2) reveals that 
W=t=W(0,,0,). If 0,> 02> 0,3, equation 
(9) shows again that W = W(0,,0,). Finaily, 
if 0, >6,=6,=6, expected rents increase 
by excluding player 1, since by Theorem 1 
W(é,,0,) < 6, =W(6,,é,). However, this 
contradicts the hypothesis that the set 
{1,...,} maximizes expected rents. Hence, 
we conclude that any rent-maximizing set of 
finalists generates expected rents of 
W(G,, 65). 


Thus, while equation (9) does not hold 
for all possible configurations of values, it 
does hold when the set of finalists is se- 
lected so as to maximize expected rents 
[equation (10)]. This result allows us to de- 
termine the set of finalists that maximizes 
the politician’s expected rents. Specifically, 
since equation (10) is decreasing in the 
highest valuation and increasing in the 
second-highest valuation, it never pays to 
exclude a player with a valuation that lies 
between the valuations of any two lobbyists 
who are in the set of finalists. Thus, the 
expected rent-maximizing set of finalists is 
determined by considering all pairwise com- 
binations of adjacent lobbyists until lobby- 
ists kK and k +1 are found such that 


W( = max W(0;,0;41)- 


To realize these rents, the politician must 
exclude players with valuations greater than 
v, from the set of finalists. Formally, we 
have shown the following: 


PROPOSITION 2: Suppose v,>v,>v,= 

* >v,. Then the politician maximizes ex- 
pected rents by constructing a set of finalists 
that excludes lobbyists with valuations strictly 
greater than v,, where k is such that 


U, 
[1+ 
Uy 2 


In order to highlight the implications of 
our results, consider the following two 
corollaries. 
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COROLLARY 1: Suppose v,;=v,>03°°° 
>v,. Then the politician dees not gain by 
constructing an agenda that excludes some 
lobbyists from the lobbying game. 


COROLLARY 2: Suppose v,>v,=v3= 
- >v,. Then the politician maximizes ex- 

pected rents by excluding the lobbyist with the 

highest valuation from the set of finalists. 


It may also be optimal for a politician to 
exclude more than one lobbyist from the 
stage-2 game. For instance, suppose v,> 
Then the politician 
maximizes expected rents by constructing an 
agenda that excludes lobbyists 1 and 2 from 
the set of finalists whenever 


These results demonstrate the exclusion 
principle: a politician may benefit from pre- 
cluding the lobbyists valuing the prize the 
most from participating in the lobbying pro- 
cess. 

We conclude with a numerical example 
to aid in elucidating our findings. Suppose 
v, = 50, v, = 40, and v;= 38. The theorem 
and lemma imply that che politician earns 
W = 36 if he does not constrain the lobbying 
process or limits lobbying to only players 1 
and 2. However, if the politician announces 
that players 2 and 3 are the finalists, then 
the expected payments to the politician are 
W(40,38) = 37.05. Thus it pays the politician 
to exclude lobbyist 1, who values most the 
prize, from participating in the lobbying 
game. 


IV. Conclusions 


This paper has examined an interesting 
principle arising in all-pay auctions: the ex- 
clusion principle. This principle states that a 
politician wishing to maximize political rents 
may find it in his best interest to exclude 
certain lobbyists from participating in the 
lobbying process—particularly lobbyists 
valuing most the political prize. In addition 
to pointing out the exclusion principle, our 


Uz \ V2 
(1+ 
2 
\ 1 
v \v 
i+] 
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Theorem 1 characterizes expected revenue 
for the entire continuum of equilibria that 
can arise in n> 2-player all-pay auctions 
with arbitrary valuations of the prize. This is 
in contrast to the results of Ellingsen (1991), 
Hillman (1988), Hillman and Riley (1989), 
and Hillman and Dov Samet (1987), among 
others, which are valid only for a subset of 
possible equilibria. 


REFERENCES 


Baye, Michael R., Kovenock, Dan and de Vries, 
Casper G., “The Economics of All-Pay, 
Winner-Take-All Contests,” Texas A&M 
University Working Paper No. 89-21, 
1989. 


: and , “The All-Pay 
Auction with Complete Information,” 


CentER Working Paper No. 9051, Tilburg 
University, 1990. 

Dasgupta, Partha and Maskin, Eric, “The Exis- 
tence of Equilibrium in Discontinuous 


MARCH 1993 


Economic Games, I: Theory,” Review of 
Economic Studies, January 1986, 53, 1-26. 
Ellingsen, Tore, “Strategic Buyers and the 
Social Cost of Monopoly,” American Eco- 
nomic Review, June 1991, 81, 648-57. 

Hillman, Arye L., The Political Economy of 
Protectionism, New York: Harwood, 1988. 

_____ and Riley, John, “Politically Con- 
testable Rents and Transfers,” Fco- 
nomics and Politics, Spring 1989, 1,17 39. 

and Samet, Dov, “Dissipation of Uon- 
testable Rents by Small Numbers of Con- 
tenders,” Public Choice, 1987, 54 (1), 
63-82. 

Moulin, Hervé, Game Theory for the Social 
Sciences, 2nd Ed., New York: New York 
University Press, 1986. 

Tullock, Gordon, “Efficient Rent Seeking,” in 
J. M. Buchanan, R. D. Tollison, and G. 
Tullock, eds., Toward a Theory of the 
Rent Seeking Society, College Station: 
Texas A&M University Press, 1980, pp. 
97-112. 


Monopolistic Competition and Optimum Product Diversity: 
Comment 


By XIAOKAI YANG AND BEN J. HEWUDRA* 


The model of Avinash Dixit and Joseph 
E. Stiglitz (1977) (DS hereafter) has re- 
ceived more and more attention from 
economists in the past decade and can now 
be regarded as the central model of 
monopolistic competition. Applications of 
the model can be found in such diverse 
fields as international trade, growth theory, 
and macroeconomics. ! 

The purpose of this short note is twofold. 
First, we suggest an alternative solution 
method to that proposed by DS and used in 
most applications of their model. Our 
method makes full use of the symmetry 
assumption employed by DS but extends 
their method, so that the resultant pricing 
rule takes into account the price-index ef- 
fect. Our method retains the intuitive ap- 
peal and simplicity of the original DS model. 
Second, we show that our solution method 
extends the range of applicability of the DS 
model significantly; more general produc- 
tion structures can now be incorporated in 
the model. 

The note is organized as follows. In Sec- 
tion I we set up the DS model and develop 
our solution method. We also discuss the 
impact that the alternative solution method 
has on the qualitative results obtained in 


* Yang: Reader, Department of Economics, Monash 
University, Clayton, Victoria, Australia 3168; Heijdra: 
Research Fellow, OCFEB, Erasmus University, Post- 
bus 1738, 3000 DR Rotterdam, The Netherlands. We 
thank the referees as well as the following people for 
their helpful comments: Yew-Kwang Ng, Peter Dixon, 
Avinash Dixit, and Gene Grossman. 

In the international-trade literature the seminal 
papers are Paul R. Krugman (1980), Wilfred J. Ethier 
(1982), and Gene M. Grossimaii and Cihanan Helpman 
(1989, 1990). A recent growth model employing the DS 
insights is Paul M. Romer (1990). The main macroeco- 
nomic applications are Blanchard and Kiyotaki (1987) 
and Julio J. Rotemberg (1987). 


the DS study and the early application by 
Paul R. Krugman (1980). Section II broad- 
ens the scope of the discussion somewhat by 
discussing existence problems associated 
with the DS approach under increasing re- 
turns and isoelastic cost functions. It is 
shown that if our solution method is used 
then the existence problems disappear, and 
the DS model yields well-defined answers. 
Finally, in Section II] some concluding re- 
marks are given. 


I. The Preference-for-Diversity Model 


Consider the following version of the DS 
model.* Consumers maximize utility subject 
to a budget constraint. The utility function 
is separable in the numeraire commodity x, 
and the other commodities (x;, i=1,...,n). 
The prices of the commodities are denoted 
by and p,, respectively. Income is 
equal to the endowment income (J) plus 
distributed profits. Since profit is zero in 
equilibrium, income equals J. The maxi- 
mization problem is: 


(1) max Xo, 


Ky Xj 


subject to 


The price and quantity indexes q and y are 


“We follow the notation introduced by DS with the 
exceptions that we do not set endowment income / 
equal to unity and that we use o to denote the substi- 
tution elasticity between the differentiated commodi- 
ties. The example discussed in the text refers to the 
constant-elasticity case of section I of DS. This is also 
the case discussed by Jean Tirole (1988 pp. 298-300). 


n l/p 
| 
i=1 
n 

T= 

295 
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defined as follows: 


1/U-o) 


(2) q= 


Here o =1/(1— p) is the elasticity of sub- 
stitution between the differentiated prod- 
ucts. Since 0< p<1, it follows that >1. 
Part a of Proposition 1 summarizes the so- 
lutions of this maximization program.’ De- 
mands are uniquely determined by the share 
function, s(q), the form of which depends 
on U. 


PROPOSITION 1: (a) The solutions for x, 
and x; are 


_ ( 
@) q 


(b) The own elasticity of demand faced by 
firm i is denoted by = (0x; /Ap;X p; /x;)I, 
and the cross elasticity of demand for firm i is 
denoted by | =(0x; /0p;X p; /x;)]. In the 
symmetric equilibrium with p; = p; Wi, j, these 
elasticities are as follows: 


(4) ———+ 
n 


Here 0(q) =(ds /0qXq/s) is the elasticity 
of the share function. 


Equation (3) is the DS solution. The elas- 
ticity of demand (¢;) as given in (4), how- 
ever, differs from that calculated by DS, 
who find ¢; = — 0. The DS approach makes 


>Proofs of propositions have been omitted for rea- 
sons of space. They are available from the authors 
upon request. 
For example, if U is Cobb-Douglas with parameter 
1—y for the numeraire good, the s(q)= y. 
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two assumptions to approximate a solution: 


ASSUMPTION 1: Each producer i ignores 
the cross-price elasticity of demand for a vari- 
ety of goods (i.e., €;,=0). 


ASSUMPTION 2: The influence of an indi- 
vidual price change on the general price index 
is ignored (i.e., dq /dp; = 0). 


Assumption 1 is ad hoc and is accepted 
here since no alternative is available that is 
both tractable and theoretically superior.” 
However, as we argue below, in any appli- 
cation of this model one must ascertain 
ex post that e,, is indeed close to zero in 
equilibrium. 

Assumption 2 is not innocuous. DS moti- 
vate this assumption on the grounds that 
the number of goods is extremely large. 
This claim may be criticized on the follow- 
ing grounds. First, the number of goods in 
the DS mode! is endogenously determined 
and is a function of technology and endow- 
ment parameters. Hence, we need some 
qualifications on these parameters to ensure 
that the number of goods is indeed large in 
equilibrium. We cannot simply assume that 
this condition is satisfied. Second, as we 
demonstrate below, the effects on the 
model’s solution of ignoring the price index 
in the calculation of the demand elasticity 
may not be trivial even if the number of 
goods is large, depending on the relative 
values of some key parameters. 

Assumptions 1 and 2 are in a sense both 
ad hoc, but there is nevertheless virtue in 
relaxing Assumption 2 and not Assumption 
1. In the setting adopted here, and from a 


One admittedly ad hoc way to introduce perceived 
price reaction is to postulate dp; /dp; = € for all i+ j, 
where 0 < é <1. Then, the price elasticity in symmetric 
equilibrium is equal to 


&(n-1)](o -14+ 0) /n. 


This provides an additional rationale for a term involv- 
ing n in the elasticity formula. See Tirole (1988 pp. 
244-5) for a critique of this approach. 


n 
i=1 
n 
i=1 
3 
8; Te. 
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purely technical point of view, Assumptions 
1 and 2 are related: =0 —o. If we 
adopt Assumption 1, then Assumption 2 is 
implied and vice versa. This equivalence 
does not validate forcing this result by as- 
suming that 7 is large; a large n must be 
the outcome, not an assumption of the anal- 
ysis. From an economic point of view, As- 
sumptions 1 and 2 are furthermore quite 
distinct: Assumption 1 relates to behavior 
external to the firm, while Assumption 2 is 
internal to the firm. If we abandon Assump- 
tion 1 we leave the world of monopolistic 
competition. This is not the case with As- 
sumption 2. 

A further issue cuts deeper at the heart 
of the DS approach. A key assumption of 
that model is that the elasticity of substitu- 
tion o is constant; no matter how many 
varieties are available, the substitution elas- 
ticity is constant. A more reasonable as- 
sumption may be that o rises with the num- 
ber of available varieties: =o(n) with 
a’ > 0. Clearly, in such a case ¢;,; will only 
tend to zero as n—~ if o(n) rises at a 
slower rate than n. Otherwise, a large n 
may still be associated with a nontrivial cross 
elasticity. 

The elasticity of demand as given in (4) 
incorporates the price-index effect (in sym- 
metric equilibrium) without significantly 
complicating the solution of the model. We 
now investigate whether our solution 
method affects any of the qualitative conclu- 
sions reached by DS. Several equilibria can 
be distinguished; (i) the market equilibrium 
(ME) is the case of decentralized Chamber- 
linian monopolistic competition, (ii) the 
constrained social optimum (CSO) is defined 
as the optimum that would be obtained by a 
benevolent social planner in the absence of 
lump-sum subsidies, and (iii) the uncon- 
strained social optimum (USO) is the opti- 
mum when lump-sum subsidies are allowed. 

ME occurs when the following two condi- 
tions are satisfied by all active firms /: 


lia) 
l+e; 


/ 


(p;—¢)x;=a. 
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The first part of (5) implies equality of 
marginal revenue and marginal cost (MR = 
MC) for each active producer, where c is 
marginal cost. The second part of (5) en- 
sures that the marginal firm breaks even, 
where a is fixed cost. In symmetric equilib- 
rium all firms charge the same price and 
produce the same amounts (i.e., p; = p; = p 
and x,=x,=x). 

Explicit solutions for p, x, and n cannot 
be obtained for the general case defined 
here. This is irrespective of the solution 
method used. DS characterize the solution 
qualitatively under the assumption that e; is 
constant |i.e., by ignoring the terms involv- 
ing n from the expression for e; in (4)]. 
They show that the equilibrium is unique, 
provided the following elasticity condition is 
satisfied: 0(q)>1-—o (DS, 1977 p. 300). In 
principle, this condition could be general- 
ized to take into account the expanded ex- 
pression for e; in (5). We have chosen, 
instead, to base our comparison of the solu- 
tion methods on a special case, where U in 
(1) is Cobb-Douglas. In that case, s(q)= y, 
and @(qg)=0. Our rationale for concentrat- 
ing on this special case is twofold. First, the 
major applications of the DS model have 
been based on this special case (see e.g., 
Krugman, 1980; Blanchard and Kiyotaki, 
1987). Second, simple and intuitive explicit 
results are possible using this assumption.® 

Part a of Proposition 2 summarizes the 
ME solutions for the equilibrium price, out- 
put, and the number of products. For pur- 
poses of comparison, parts b and c of 
Proposition 2 restate the results of DS. 


PROPOSITION 2: Assume that U is 
Cobb-Douglas. Then s(q)= y, 0(q)= 0. 


Assuming a constant-elasticity-of-substitution (CES) 
formulation for U still does not render 0(q) constant 
and hence does not lead to explicit solutions. See 
equation 6 in DS (p. 298) for the relationship among 
the substitution elasticity, the share function, and its 
elasticity 
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(a) The market equilibrium (ME) solutions 
for n, x; = x, and p; = p are: 


I 
(6) | 


p\yl-a 


p(yl—a) 


7 
+6 


n, = —(1—p)+ p. 
a 


(b) The constrained social optimum (CSO) 
solutions for n, x; = x, and p; = p are: 


c 
(7) 
p 
ap 
yl 
a, = —(1-p). 
a 


(c) The unconstrained social optimum (USO) 
solutions for n, x; = x, and p; = p are: 


(8) 
ap 
* (1-9) 
yI(1— p) 
alp+y(1-p)] 


(d) The degree of diversification in ME, CSO, 
and USO may be compared as follows: 


ny 


p€[0,p) =n, >n, 


p=p>n =n 


e 


p€(p,1] =n, <n, 
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where 
p=—p(a,y;1) ap /aa<0 
and 
ap /ay <0 
p(a,1;1)=0. 


(e) In ME the own and cross price elasticities 
are equal to the following: 


p 


a 


DS themselves find x’.=ap/c(— p), 
c/p, and n’,= p)/a. Comparing 
these values to the ones in (6) reveals that 
DS underestimate the number of firms (al- 
though by less than 1, since 0 < p< 1), un- 
derestimate the price, and overestimate the 
quantity. The difference in price disappears, 
however, if one assumes (as do DS) that rn’, 
represents a large number. Of course, in 
view of the definition for n’, this will only be 
large if fixed costs (a) are low relative to the 
share of endowment income spent on the 
differentiated commodities (y/) and if the 
goods are not close to perfect substitutes 
(p <1). If the value of p is close to 1 then 
it all depends on the relative magnitude of 
(1— p)/a even if y/ is large. 

A more subtle consequence of our correc- 
tion to the DS solution method appears 
when we compare the ME solutions to those 
corresponding with the constrained and un- 
constrained social optima as defined by DS 
(1977 pp. 300-2). Part b of Proposition 2 
summarizes the CSO solutions for the equi- 
librium price, output, and the number of 
products. Comparing the CSO solutions to 
the ME solutions reveals that the market 
equilibrium does indeed lead to “a little too 


| 
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much” diversification (i.e., n, >n,). By us- 
ing our solution method one finds that the 
conclusion reached by DS (1977 p. 301) 
regarding the equivalence of ME and CSO 
no longer holds. 

Part c of Proposition 2 summarizes the 
USO solutions for the equilibrium price, 
output, and the number of products. As in 
DS, price equals marginal cost, and the 
consumers pay the lump-sum subsidies to 
the producers to render profits equal to 
zero. 

Part d of Proposition 2 summarizes the 
results regarding the degree of diversifica- 
tion under the different scenarios. As in DS 
(1977 p. 302) USO yields a greater degree 
of diversification than the CSO (i.e., n, > 
n.). However, unlike DS, no general conclu- 
sion can be drawn regarding the difference 
in the degree of diversification between 
USO and ME. As is reported in (9) there 
exists a critical value for p, denoted by p, 
which determines the difference between n, 
and n,,. Furthermore, p depends negatively 
on both fixed costs (a) and the share param- 
eter (y). 

An important early application of the DS 
model to international trade issues is Krug- 
man (1990). Application of our solution 
method reverses some of the conclusions 
drawn by Krugman (1980), thereby resolving 
an anomaly in that model. The Krugman 
model is a special case of the DS model 
with s(q) = y = 1. The solution of this model 
can be obtained by applying the results in 
Proposition 2. In Krugman’s model, J is 
interpreted as the size of the economy. The 
emergence of international trade leads to 
an increase in the size of J, which in turn 
influences the price level, output, and the 
degree of diversification. Using the results 
from part a of Proposition 2, it is straight- 
forward to verify the following comparative- 
static results: 


Op. Ox. én, 
> 
ol al al 


Krugman (1980) himself uses the DS ap- 
proach to solve his model. He makes the 
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sharp prediction that the price and output 
are independent of the size of the economy; 
that is, an opportunity for international 
trade has no effect on the wage price of 
goods or on the output level of each good. 
In contrast, x, and p, as obtained from 
part a of Proposition 2 do depend on the 
size of the economy with sign given in (11), 
and therefore on the opening up of interna- 
tional trade. 

Krugman (1980 p. 953) expresses concern 
about the counterintuitive feature of his so- 
lution but claims that we need a more com- 
plicated model to sort out the effects of 
international trade on prices and output 
levels of each good. Using our solution 
method, no further complications to the 
model are needed in order to obtain plausi- 
ble results. Of course, if J and n are large, 
as Krugman assumes, then the com- 
parative-static predictions derived from 
Krugman’s solution are good proxies for the 
correct solutions given by our approach. 


II. Increasing Returns to Scale and 
Isoelastic Cost Functions 


In the original DS model, increasing re- 
turns to scale exist because of the existence 
of fixed cost. We now adapt the model in 
the DS paper and specify the homogeneous 
production functions for x; and Xx, as fol- 
lows: 


Here /, denotes the quantity of labor em- 
ployed to produce good i (i=0,1,...,n), 
and 8 <1 characterizes globally increasing 
returns to scale in producing good i. Each 
consumer is endowed with one unit of la- 
bor, and the size of the labor force is L. 
Labor is assumed to be the numeraire and 
the nominal wage rate is given as w=1. 
Proposition 3 summarizes the results using 
our approach. 


PROPOSITION 3: (a) The market equilib- 
rium (ME) exists if p> B. The solutions for 
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X; =X, D; = DP, Xo, Po» and n are: 


(13) 
yL(p — B) 
(14) 
(15) + _ 
p—B 


(1— p) 


(b) The market equilibrium does not exist if 
p<B. 


In contrast, the solution obtained by using 
the DS approach does not exist for this 
model except if p = B, but at that point the 
number of firms (and the degree of diversi- 
fication) is indeterminate. Although p = B is 
not impossible, the DS solution method is 
incorrect in this case. Part a of Proposition 
3 shows that a well-defined market equilib- 
rium does exist for values of p in excess of 
B. The DS method finds an equilibrium 
even though the correct equilibrium does 
not exist, if B = p. 

The intuition behind the “knife-edge”’ re- 
sult is quite straightforward. The ratio of 
marginal cost (MC) to average cost (AC) is 
constant (i.e. MC=£8AC). For the DS 
method we must simultaneously satisfy MC 
= MR, P=AC, and MR = pP, where MR 
represents marginal revenue and P is the 
price. This can only be the case if B = p. At 
that point there is nothing determining n. 
In our solution method this knife-edge 
problem disappears because the elasticity of 
demand depends on n, and the solution 
satisfies MC = MR, and P = AC as before, 
but MR =[p(n —-1)/(n — p)]P. This can be 
solved for meaningful solutions if B < p.’ 


’This result is not specific to the simple production 
structure used in (12). In fact, the result holds for any 
multi-input homogeneous production function pro- 
vided all factors are variable, so that the cost function 
is isoelastic. 
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The knife-edge problem associated with 
the DS approach is not without practical 
relevance. Indeed, of the four production 
functions most commonly used by econo- 
mists (i.e., Cobb-Douglas, constant elasticity 
of substitution, Leontief, and linear), only 
the linear production function is compatible 
with the DS approach, provided fixed costs 
exist.® 


III. Concluding Remarks 


We have shown that a simple alternative 
solution method exists for the DS model— 
one which makes full use of the structure of 
that model. We have traced through the 
implications of this new solution method on 
the qualitative conclusions derived from the 
DS model itself and various models derived 
from it. The implications of our solution 
method in some cases reverse the conclu- 
sions derived on the basis of the DS solu- 
tion method. 

The DS solution method is valid if the 
number of commodities (the degree of di- 
versification) is large: the validity of the 
assumption of monopolistic competition de- 
pends on a large number of goods. The 
number of goods, however, is endogenously 
determined within the model and depends 
on specifications of technology and endow- 


“Ethier (1982) applies the ideas of DS on the pro- 
duction side of his model. Since he employs the DS 
solution method, this model cannot generate a well- 
defined equilibrium if the production function for pro- 
ducer goods is isoelastic, while our approach can. 

Our approach also has relatively minor conse- 
quences for the recent dynamic model of Grossman 
and Helpman (1989). Specifically, it can be shown, 
using our approach, that the 7 = 0 line is nonlinear and 
downward-sloping and not horizontal as Grossman and 
Helpman (1989 p. 1268) suggest. This modification is of 
relatively minor importance for their linearized 
steady-state analysis. It may, however, gain importance 
in a full nonlinear dynamical analysis of their model. 

The knife-edge result (plaguing the DS and Ethier 
models) is not a problem in Grossman and Helpman’s 
(1989) model because positive profits exist due to the 
predetermined nature of n, the number of product 
varieties. Hence, the constraint P = AC is not opera- 
tive. 
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ments. Therefore, one cannot tell a priori 
whether the number of goods is large, 
whether the cross price elasticity is trivial, 
and whether the DS approach gives accept- 
able approximate solutions unless one knows 
the explicit specification of the model and 
has solved the model using our approach. 

Our solution method widens significantly 
the applicability of the DS model. Specifi- 
cally, whereas the DS solution approach 
cannot accommodate isoelastic (decreasing) 
cost functions, our method can. Hence, 
Cobb-Douglas, CES (constant elasticity of 
substitution), and Leontief production tech- 
nologies are not problematic in the DS 
model, provided our solution method is em- 
ployed. 

A final argument in favor of our approach 
is the fact that no additional complexity is 
introduced. The adjustment to the pricing 
rule is simple and intuitive, and explicit 
solutions for the endogenous variables in 
the model can still be calculated easily. On 
the basis of these arguments we suggest 
replacing the DS solution method with ours. 
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Monopolistic Competition and Optimum Product 
Diversity: Reply 


By Avinasu K. Drxit AND JosepuH E. STIGLitz* 


Our model of monopolistic competition 
and product diversity (Dixit and Stiglitz, 
1977) was intended for the large-group case. 
Therefore we used an approximation ne- 
glecting from the own and cross elasticities 
terms of order 1/n, where n is the equilib- 
rium number of firms. Xiaokai Yang and 
Ben J. Heijdra (1993) have generalized the 
solution for a special case of our model, 
retaining some of these terms but assuming 
a unitary elasticity of substitution between 
the monopolistic group and the numeraire 
good. We will discuss their model by placing 
it in an even more general framework. 

The underlying monopolistically competi- 
tive equilibrium of a symmetric group is 
governed by two conditions: each firm’s 
maximization equates marginal revenue and 
marginal cost, and free entry equates aver- 
age revenue and average cost. Let n denote 
the number of products or firms in the 
group, x the output of each such firm, and 
p the price of each product in the group 
relative to the competitive numeraire. Let 
C(x) be each firm’s cost of production, and 
let e(p,n) be the elasticity of demand per- 
ceived by each firm. Then the two condi- 
tions can be written 


(1) =C'(x) 


e(p,n) 
(2) p=C(x)/x. 


Further, let the demand relation in equilib- 
rium be 


(3) p=D(x,n). 
*Department of Economics, Princeton University, 


Princeton, NJ 08544, and Department of Economics, 
Stanford University, Stanford, CA 94305, respectively. 


These three equations determine p, x, and 
n. Unless one wants a closed-form solution, 
the various functions, including the elastic- 
ity e(p,n), can have any functional form 
(restricted only by minor requirements for 
existence and the relevant second-order 
conditions). This formulation subsumes not 
only Yang and Heijdra’s model, but also 
other widely different models including the 
one based on Hotelling-like spatial product 
differentiation. Indeed, Elhanan Helpman 
and Paul Krugman (1985 Chapters 6-8 
[especially section 7.1]) develop the 
monopolistic competition model for their 
international trade applications in just such 
a general way. 

Our original (1977) model started with a 
CES (constant elasticity of substitution) util- 
ity function and then employed an approxi- 
mation ignoring some terms of order 1/n to 
make « constant. Yang and Heijdra dis- 
pense with this approximation, using in- 
stead 


o 
(4) e(p,n)=a- solid. 


n 


where o, required to be greater than 1, is 
the elasticity of substitution between any 
pair of goods within the monopolistically 
competitive group, p =(a — 1)/c. Finally, 0 
is the share of the consumer’s expenditure 
on the monopolistically competitive group 
and is a function of the group price index 
(n'/?p). Since Yang and Heijdra insist on a 
closed-form solution, they have to assume 
that @ is constant, which amounts to requir- 
ing a unitary elasticity of substitution in 
utility between the monopolistically compet- 
itive group and the perfectly competitive 
numeraire good. 

We believe that Yang and Heijdra’s mis- 
placed emphasis on closed-form solutions 


] 
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has led them astray. The desirable features 
they claim to have added can be attained in 
far better and more general ways in the 
formulation (1)-(3) above. To the extent 
that closed forms are needed for simple 
applications in macroeconomics and inter- 
national trade, for example, Yang and Heij- 
dra’s formulation adds a minor wrinkle at 
one point but incurs a large cost in reduced 
generality elsewhere. 


I. Justification for the Approximation 


Yang and Heijdra (1993) argue that since 
the equilibrium number n, of firms is en- 
dogenous, an assumption neglecting 1/n, 
needs to be justified in terms of the underly- 
ing parameters. This is true, but not at all 
hard to do. If one insists on a closed form, 
one must resort to the unitary-elasticity case 
used by Yang and Heijdra. There, n,. is 
large if and only if 


(5) 


where y is the share of the monopolistic 
group in expenditure, / is the economy’s 
labor endowment, p measures substitution 
within the monopolistic group, and a is the 
fixed cost of producing each variety in this 
group. However, without a closed-form so- 
lution, we can get a qualitative idea of which 
parameter combinations are conducive to a 
large n. by examining the total differential 
of the system (1)—(3) above. In rough terms, 
we want low fixed costs and imperfect sub- 
stitution between products in the group. 

We should also point out that Yang and 
Heijdra destroy the force of their own criti- 
cism when they assume that cross elastici- 
ties are negligible. Here they see no need to 
express the condition in terms of exogenous 
parameters: “in any application of this 
model one must ascertain ex post that ¢;; 
is indeed close to zero in equilibrium” (Yang 
and Heijdra, 1993 p. 296 [italics in the origi- 
nal]). 


II. Internal Consistency of the 
Approximation 


Having decided to treat 1/n as small, we 
do so with consistency throughout our pa- 
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per. Yang and Heijdra (1993) emphasize the 
terms of order | /n in the own elasticities ¢; 
but ignore them in the cross elasticities ¢;;. 
This is particularly strange in view of the 
fact that the error terms in treating e¢; > —o 
and ¢,,~0 are exactly the same, namely, 
—1+0@(q)|/n. It does not seem logically 
consistent to emphasize the error in one 
place and to ignore it in another in the 
same model. 

If Yang and Heijdra want to consider a 
group sufficiently small that 1/n is not neg- 
ligible, they should do so consistently. When 
e,, is significantly different from zero, it 
raises serious doubts concerning the validity 
of the price-Nash game stipulated in the 
model, and alternatives including tacit col- 
lusion and entry deterrence should be ex- 
plored. 


III. Passage to a Competitive Limit 


Yang and Heijdra (1993) argue that there 
is merit in letting elasticities increase with 
n. This is true in some applications, al- 
though in others (e.g., Helpman and Krug- 
man, 1985 Chapters 7-8) it makes no dif- 
ference to the qualitative results. In any 
case, a correction of order 1/n does not 
capture such dependence in a satisfactory 
way. In Yang and Heijdra’s equation [see 
(4), above], as n goes to infinity the own 
elasticity will rise in numerical value to o. If 
one wants a perfectly competitive limit of 
the model, as many would, the own and 
cross elasticities will have to rise to infinity, 
not to a finite limit. 


TV. Optimality of the Equilibrium 


While the limiting case of perfect compe- 
tition and full optimality may be of some 
interest, we found the equilibrium based on 
our approximation to be _ second-best 
(break-even constrained) optimal. This re- 
sult is sensitive to the specification, as we 
pointed out. What is really needed is a 
constant ratio of average and marginal utili- 
ties (or consumer surplus and revenue), and 
a constant elasticity of demand allows that. 
However, Yang and Heijdra’s (1993) modi- 
fication can only shift the equilibrium in a 
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small way when n is large; the numbers and 
prices will be close to those in our approxi- 
mation, and therefore their equilibrium will 
be close to being constrained optimal. Seri- 
ous objections to the optimality result must 
again be based on more substantial depar- 
tures from our specification. For a spatial- 
differentiation example in which the di- 
mension of the attribute space makes a 
difference, see Stiglitz (1986). 


V. Existence of Equilibrium 


Yang and Heijdra (1993) point out that if 
€ is constant in (1) and increasing returns 
take the power form, the system (1)—(3) is in 
general inconsistent. Write the cost function 
as C(x)=x* with B<1; then C’(x)= 
BC(x)/x, and dividing (1) by (2) gives 

1-1/e=B 

which is generally impossible when ¢ and B 
are separately and exogenously specified. In 
Dixit and Stiglitz (1977) the cost function 
had a fixed component with constant mar- 
ginal cost, so the problem did not arise. 
Yang and Heijdra propose to rescue mat- 
ters for the power cost function by letting « 
vary endogenously as in (4) above. However, 
for large n the variation of € is very slow, 
and the equilibrium shifts dramatically when 
some parameter changes by a small amount. 
The problem of such “fragile” equilibria is 
in practice not much better than nonexis- 
tence (see e.g., Lawrence Summers, 1991). 
Again we think that Yang and Heijdra’s aim 
is much better accomplished in the general 
setting of (1)—-(3) above. 


VI. Price Paid for Abandoning the 
Approximation 


To introduce a term of order 1/n Yang 
and Heijdra (1993) have restricted the model 
to the case of a unitary elasticity of substitu- 
tion between the monopolistic group and 
the numeraire good. We judge the trade-off 
to be quite unfavorable; the drastic reduc- 
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tion in parameter space of the model buys a 
correction that in practice is of minor sig- 
nificance. 

This case has been used in some applica- 
tions to make some broad thematic points 
about the nature of trade or the possibility 
of underemployment equilibrium, where the 
exact value of the elasticity was irrelevant. 
However, the presence or absence of 1/n 
terms in elasticities was equally irrelevant 
there. 

Finally, we should note that Yang and 
Heijdra are not the first to add the 1/n 
terms to our model, although they appear to 
have been the first to be published doing so. 
To record priority, we should mention an 
unpublished note by Kelvin Lancaster 
(1980); doubtless there are others unknown 
to us. 
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Should Governments Learn to Live with Inflation? Comment 


By Jonas AGELL AND BENGT-CHRISTER YSANDER* 


Economists have for a long time tried to 
convince suspicious policymakers of the 
merits of various indexation policies as a 
way of insulating the economy from the 
potentially pervasive effects of inflation. In 
an interesting recent paper, Stanley Fischer 
and Lawrence Summers (1989; hereafter FS) 
argue that the suspicion is warranted. Rely- 
ing on the dynamic consistency theories of 
inflation developed by Finn Kydland and 
Edward Prescott (1977) and Robert Barro 
and David Gordon (1983), they reach two 
primary conclusions: 


First, recognizing that inflation is a 
choice variable that will be affected by 
changes in nominal institutions sub- 
stantially undercuts the case for infla- 
tion mitigation measures by govern- 
ments that have not established a firm 
anti-inflationary reputation. It is easy 
to construct examples in which the 
costs of the extra inflation that results 
from inflation palliation outweigh the 
direct benefits of the lower cost of a 
given inflation rate. Second, foregoing 
indexation is to some extent a substi- 
tute for developing a reputation for 
pursuing anti-inflation policies. Na- 
tions with strong anti-inflationary rep- 
utations can more easily afford indexa- 
tion policies than nations without such 
reputations. [1989 p. 382] 


The generality of the claims of FS fits less 
well with the specificity of their theoretical 
model. First, the FS analysis is appropriate 
for economies in which wage-setting takes 


*Professor Bengt-Christer Ysander, of Uppsala 
University and the Royal Swedish Academy of Sci- 
ences, passed away on March 29, 1992. Jonas Agell is 
at the Institute for International Economic Studies, 
S-10691 Stockholm, Sweden. The authors appreciate 
the comments of Susanne Ackum, Anders Forslund, 
Bertil Holmlund, Torsten Persson, and in particular, 
an anonymous referee. 


place in a decentralized fashion and, as a 
corollary, wage-setters do not behave strate- 
gically. Yet they offer their model as an 
explanation for the fact that “...most in- 
dustrial countries do relatively little to miti- 
gate its [inflation’s] adverse effects” (FS, 
1989 p. 382). We show below that their 
conclusions need not apply to economies in 
which wage-setting is managed in a more 
centralized and strategic manner. Since a 
number of industrialized economies fall into 
this latter category or between the two ex- 
tremes of centralized and decentralized 
wage-setting, there is little ground for FS’s 
general presumption against inflation pallia- 
tion. 

Second, in the setup of FS the only way 
for unanticipated inflation to affect the 
economy is through a standard expecta- 
tions-augmented Phillips curve. Yet, proba- 
bly much more important effects are the 
intersectoral and intergenerational redistri- 
butions of income and wealth associated 
with unanticipated inflation. These redistri- 
bution effects provide a simple zero-sum 
rationalization of why economies with unin- 
dexed institutions may end up in a high- 
inflation equilibrium. 


I. Review of the Fischer-Summers Analysis 


Following the model of Barro and Gor- 
don (1983) of the credibility problem of 
low-inflation monetary policy, FS assume a 
short-run Phillips curve 


(1) U=U,—a(p- p*) 


where U is the unemployment rate, U, is 
the natural rate of unemployment, p is the 
log of the price level, and p* is the log of 
the expected price level. Adopting the con- 
vention that the initial price level is unity 
(i.e., Py =), the inflation rate simply be- 
comes p. 
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The government sets p to minimize the 
loss function 


(2) L + xbp* 


where the social efficiency costs of actual 
(as opposed to expected or unanticipated) 
inflation are measured in an Harberger-like 
manner by the quadratic term 4bp”.' In the 
one-shot game between a myopic govern- 
ment and the private sector considered by 
FS, equilibrium inflation (in the sense that 
p = p*) becomes 


(3) p=(a/b)U,. 


Equation (3) lies at the heart of the policy 
conclusions of FS. Equilibrium inflation in- 
creases with a and decreases with b. Inter- 
preting these results in terms of indexation, 
FS conclude that wage indexation (which 
reduces a and makes the Phillips curve 
steeper) lowers inflation and increases so- 
cial welfare. More provocatively, they also 
contend that most other forms of indexation 
increase inflation and decrease social wel- 
fare by decreasing b.” 


II. Indexation in a Model of Strategic 
Union—Government Interaction 


Let us start with the case of strategic 
wage-setting in an economy with unindexed 
institutions. Following John Driffill (1985), 
Henrik Horn and Torsten Persson (1988), 
and others, we consider a game between an 
all-encompassing monopoly union setting 
the rate of wage inflation and a government 
controlling the rate of general inflation. 

While the government’s objective func- 
tion (2) carries over to our model, describ- 
ing union behavior requires some additional 
notation and definitions. We assume that 


'To facilitate comparison with our model of strate- 
gic union—government interaction, we assume that the 
parameter k in equation (2) of FS, representing the 
government’s target rate of unemployment, is zero. 

“Evaluating (2) in a fulfilled-expectations equilib- 
rium, it is easy to see that the government’s loss 
function is decreasing in b. 
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the union cares about the real wage after 
taxes (Tor Hersoug, 1984). To capture the 
well-known effect of nonindexation of tax 
brackets in a progressive income-tax system, 
we consider a real disposable income func- 
tion of the form 


(4) 


where 0<7<1 and capital letters denote 
wage and price levels. The parameter r+ 
represents the degree of tax progression. 
The smaller is +, the more substantial is the 
bracket-creep induced by higher wage in- 
flation. 

Unemployment depends positively on the 
log of the real wage: 


(5) U=w-p. 


The objective of the union is to maximize 


(6) —4U*+a,(rw—- p)—84bp?. 


The first and second terms are of a standard 
form. As in Driffill (1985), the utility func- 
tion is quadratic in unemployment and lin- 
ear in the real wage after tax. We also allow 
the union to care about actual inflation; an 
all-encompassing union should account for 
the fact that its members ultimately bear 
some of the efficiency costs of inflation 
(Richard Jackman, 1988). These efficiency 
costs are reflected by the third term. The 
constant 6 (0 <6 <1) represents the share 
of the social efficiency costs of inflation that 
is internalized by the union. When 6=1, 
government and union give the same weight 
to the costs of actual inflation. 

Consider a single-stage Stackelberg game, 
in which the union (the leader) sets the 
wage before the government determines the 
price level. Apart from conforming with FS’s 
benchmark case of a myopic accommodat- 
ing government, this way of structuring the 
game also seems to parallel the institutional 
setup in many European countries (see e.g., 
Hersoug, 1985; Horn and Persson, 1988). 
Minimizing (2) subject to (5), treating w as 
parametrically given, gives us the govern- 


VOL. 83 NO. 1 


ment’s reaction function: 


Ww 


7 =—_., 
(7) 

We assume that the union knows the 
government’s reaction function. Maximizing 
(6) subject to (5) and (7) yields the optimal 
wage: 


(8) w*=(1+b)a,[7(1+ 5) -1] 
x[b(é+b)] 


which combined with (7) gives equilibrium 
inflation as 


(9) p*=a,[r(1+b)-1][b(5+5)] 


In the model of FS, equilibrium unem- 
ployment, U,, is taken as given and unaf- 
fected by changes in nominal institutions. In 
our model with wage-setting agents, equilib- 
rium unemployment is a choice variable 
conditioned on the institutions of the econ- 
omy. Combining (5), (8), and (9), we obtain 


(10) U*=a,[r(1+ 


In the verbiage of the literature, equa- 
tions (9) and (10) characterize the “discre- 
tionary,” or “time-consistent,”’ solution to 
our stabilization policy game.* While both 
the government and the union would prefer 
lower inflation, neither of them has an in- 
centive to deviate from the wage and price 
decisions given in (8) and (9). However, a 
cursory inspection reveals that the equilib- 
rium depends on the institutional environ- 
ment. By exercising different indexation op- 
tions, both the government and the union 
can alter the policy framework which both 
players take as given when setting prices 
and wages. 

The main conclusion of FS is that all 
measures that decrease the efficiency costs 


3In the following, we assume that the tax system is 
not “too” progressive, in the sense that 7 >1/(1+ b). 
As a consequence, p* and U* are always nonnegative. 
The intuition is discussed in what follows. 
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of actual inflation (like paying interest on 
nominal money holdings or reducing the 
deadweight losses induced by nominalistic 
capital taxation by indexing the tax base) 
are likely to be counterproductive. This re- 
sult need not hold in economies where 
wage-setters behave strategically. 

Differentiating (9) and (10) with respect 
to b, we find that 


dp* 
(11) =a,{(6+b)(1-7) 


—b[r(1+ b)—-1]}[b(6 + b)] 7 


au * 


Only in the special case when +=1 and 
5 = 0 do we obtain results analogous to those 
of FS. Indexation measures that reduce b 
then lead to higher inflation, while unem- 
ployment remains unchanged. Consider next 
the case when t=1 and when the union 
internalizes some of the excess burden of 
inflation (i.e., 6 > 0). From (11), we see that 
a lower value of b again leads to higher 
inflation. However, from (12), it now follows 
that unemployment decreases. As the union 
knows the government’s reaction function 
(7), it also realizes that a lower value of b 
implies a more inflationary policy rule. Con- 
sequently, a union that cares about inflation 
moderates its wage claims, which in turn 
implies lower real wages and unemploy- 
ment. An analysis of the net welfare effects 
of decreasing b must set the gains from 
lower unemployment against the costs from 
higher inflation.‘ 

Even when the union does not care about 
inflation, it is easy to find examples in which 
protection against the efficiency costs of in- 
flation is beneficial. When 7 <1, we note 
from (12) that a decrease in b is associated 
with a decrease in unemployment irrespec- 
tive of the value of 6. The bracket-creep 


*Using (2), we can show that the net outcome of 
decreasing b is higher welfare whenever 6 = 1. 


implied by a progressive and unindexed tax 
on earned income means that the union 
again moderates its wage claims when a 
lower value of b makes the government less 
prone to inflation-fighting. This mechanism 
also explains why dp* /db is an increasing 
function of +. Indeed, if 7 is sufficiently 
small, a lower value of b may reduce both 
unemployment and inflation. 

While the preceding argument casts doubt 
about FS’s general claims concerning the 
effects of mitigating the efficiency costs of 
inflation, it also suggests that nonindexed 
tax brackets might be a lesser evil than 
generally perceived. From (9) and (10), it is 
easy to see that we can reach a social opti- 
mum with both inflation and unemployment 
equal to zero by setting the degree of tax 
progression as 


(13) 


The inflation and unemployment character- 
izing the initial equilibrium is due to the 
fact that the union, unlike the government, 
wants to trade lower employment for higher 
real wages. The trade-off facing the union 
depends, however, on the degree of tax 
progression. When (13) holds, the union 
realizes that bracket-creep and inflation will 
undercut any increase in nominal wages. As 
the union still dislikes unemployment, the 
net outcome is a realignment of government 
and union objectives, which in turn implies 
an equilibrium with zero inflation and un- 
employment. As a corollary, it is easy to 
show that indexation of tax brackets is un- 
warranted. By removing one of the costs 
that wage inflation imposes on the union, 
indexation of tax brackets produces more 
inflation and unemployment.° 


°To show this, replace equation (4) by the dispos- 
able income function Y=(W/P)’ and derive the cor- 
responding expressions for inflation and unemploy- 
ment. 
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Some Alternative Games 


Game-theoretical policy analysis seldom 
generates robust insights. Our union- 
government game is no exception. Aban- 
doning the notion of a strong union and a 
weak government, our results will change. 
In a Nash equilibrium, the government and 
the union simultaneously set prices (govern- 
ment) and wages (union), while taking each 
other’s optimal reactions as given.° The 
union’s reaction function, obtained by maxi- 
mizing (6) subject to (5), holding p con- 
stant, is given by 


(14) 


W= p+ 


Combining (14) and (7), equilibrium infla- 
tion and unemployment become 


(15) p* =a,t/b 


(16) 


U* =ayr. 


The Nash equilibrium of our model is 
closely related to the discretionary equilib- 
rium of the model of FS. In both cases, 
private agents act nonstrategically vis-a-vis 
the government. We may thus expect results 
in line with those of FS. This is indeed the 
case. Equations (15) and (16) imply that 
indexations that decrease the efficiency costs 
of inflation increase equilibrium inflation 
while leaving unemployment unchanged. As 
the union treats inflation as beyond its in- 
fluence, it has no incentive to moderate 
wages in response to a smaller value of b. 
For the same reason, it is also easy to show 
that inflation protection of tax brackets has 
no effects on inflation and unemployment. 

Consider next a Stackelberg game in 
which the government is the leader. From 


°We note in passing that this setup can be inter- 
preted as a game between an accommodating govern- 
ment and an aggregate of several unions, where the 
individual union is small enough to ignore the govern- 
ment’s reaction to its wage decision (see e.g., Andrew 
Oswald, 1979). 
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the union’s reaction function (14), it is clear 
that equilibrium unemployment is the same 
as unemployment in the Nash solution. 
Equilibrium inflation, on the other hand, is 
zero, the reason being that the government 
now uses its influence on union wage forma- 
tion to induce the union to obtain its real 
wage target at a lower nominal wage than in 
the Nash equilibrium. 

That a strong government may produce a 
better outcome than a weak accommodating 
one is hardly surprising. This result is, how- 
ever, not robust. If the government cares 
about its tax revenue, the nonindexation of 
tax brackets provides an incentive for gener- 
ating extra inflation. From the union’s reac- 
tion function, the government knows that 
higher inflation increases wages proportion- 
ally and that the resulting bracket-creep in- 
creases real tax revenue whenever 7 <1. In 
contrast to the case of a strong union and 
weak accommodating government discussed 
in the preceding section, inflation protec- 
tion of tax brackets now leads to lower 
inflation. 


IV. A Zero-Sum Model of Inflation 


In their systematic account of the real 
effects of inflation, Fischer and Franco 
Modigliani (1978 p. 827) conclude that 
“..the wealth redistributions arising from 
unanticipated inflation are large, of the or- 
der of 1 percent of GNP per 1 percent 
unanticipated increase in the price level.” 
While it is difficult to be more precise about 
the direction of these redistributions, two 
broad tendencies stand out. First, in an 
unindexed economy, unexpected inflation is 
likely to imply redistributions from the old 
to the young, partly because the ratio of net 
nominal to real assets typically rises with 
age and partly because inflation erodes the 
value of unindexed social security (Fischer 
and Modigliani, 1978 pp. 823-6). Second, in 
countries where the government is a net 
debtor, it benefits from surprise inflation. 

These wealth transfers may amount to 
much more than mere bookkeeping. Con- 
sider the effects of adding a retired genera- 
tion to the model of union—government in- 
teraction of Section II. This third player 
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holds nominally denominated government 
debt and obtains unindexed social security 
but has no influence on the price and wage 
decisions of government and union. Both 
government and union then gain from 
unanticipated inflation: the government by 
getting its debt amortized, union members 
by an induced transfer from the retired gen- 
eration. Formally, the government mini- 
mizes the loss function 


(2') + 3bp* — y,( p— p*) 
while the union maximizes 
(6’) p) 


The last term in each expression repre- 
sents the distributional gain from unex- 
pected inflation.’ While p is the actual price 
level, p* is the price level expected by the 
retired generation. We assume that both 
union and government treat p° as exoge- 
nously given (in a fulfilled-expectations 
equilibrium, p* equals the equilibrium in- 
flation rate). The parameters y, and y, 
(y,,Y¥, > 9) represent the strengths of the 
union’s and the government’s incentives to 
create unexpected inflation. 

Consider again the Stackelberg game of 
Section II, in which the union (leader) sets 
the wage before the government (follower) 
determines the price level. Repeating the 
steps followed above, we obtain 


(Y) p*={by,+ y+ a,[7(1+ b)-1]} 
(10) U*={y,—y,5 + a,[7(1+ 5) —1]} 


x(5+b)7'. 


‘Allowing for quadratic terms in unexpected infla- 
tion would complicate the algebra, without affecting 
the essence of our results. 


Differentiating, we find that 


(17a) 
(17b) <0 if 6>0 
(17c) —>0 
aU* 
(17d) —>0 
OY, 


The intuition is not hard to find. Increasing 
the gains from unexpected inflation in- 
creases equilibrium inflation. In the case of 
the union, the incentive to create inflation 
operates in a roundabout way. When y,, 
increases, the union sets a higher nominal 
wage, knowing that the accommodating gov- 
ernment thereby is induced to create extra 
(unexpected) inflation. This mechanism also 
explains why an increase in y, leads to 
higher unemployment. As the government 
only partially accommodates the wage push, 
real wages and unemployment increase. Fi- 
nally, we note that an increase in y, reduces 
unemployment, at least as long as the union 
internalizes some of the deadweight burden 
of additional inflation. When the govern- 
ment is given a marginal incentive to create 
surprise inflation, a union that cares about 
inflation will moderate its wage claims, 
which in turn implies lower unemployment 
and real wages. 

From the above, it is clear that reducing 
the scope of intergenerational redistribu- 
tions from the retired generation to the 
collective of younger trade-union members 
unambiguously increases social welfare. By 
indexing social security and private loan 
contracts, the value of y, declines, which 
lowers inflation and unemployment. Also, 
with a social loss function quadratic in U 
and p, it is easy to show that measures that 
reduce y, (e.g., indexation of government 
debt instruments) increase welfare, since the 
gain in the form of lower inflation then 
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always dominates the cost in terms of higher 
unemployment. 

The policy lesson is straightforward. For 
every loser from unanticipated inflation 
there is a gainer. If the gainers also happen 
to be key players in the inflationary process, 
the outcome is high inflation. This zero-sum 
rationalization of inflation also goes some 
way toward explaining the common practice 
of indexing social-security benefits.* How- 
ever, it also raises the question of why other 
forms of protection against unexpected 
wealth redistributions are so rarely seen. In 
particular, by eliminating the other main 
channel for wealth redistributions from un- 
expected inflation, widespread indexation of 
private and government loan contracts may 
reduce inflation substantially. 


V. Conclusions 


There is no general time-consistency ar- 
gumeut against indexation policies. In 
economies where wage-setters behave 
strategically, governments may have much 
to gain from better inflation protection. If 
we accept the view that much of the driving 
force in the inflationary process comes from 
the zero-sum redistributions from unex- 
pected inflation, we also have a powerful 
argument for indexations of social security 
and private and government loan contracts. 
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Should Governments Learn To Live with Inflation? Reply 


By STANLEY FISCHER AND LAWRENCE SUMMERS * 


Until recently, it was common for 
economists to urge the widespread adoption 
of indexation, and for policymakers in low- 
inflation economies to reject that advice as 
a counsel of despair, likely only to increase 
inflation and reduce economic welfare. The 
main contribution of our 1989 paper 
“Should Governments Learn to Live with 
Inflation?” was to provide a coherent model 
that explains the policymakers’ viewpoint. 
As we stated, “It is easy to construct exam- 
ples in which the costs of the extra inflation 
that results from inflation palliation out- 
weigh the direct benefits of the lower cost 
of a given inflation rate” (p. 382). 

While we were pleased to have found an 
explanation for a hitherto puzzling conflict 
of views between economists and_ policy- 
makers, no reader of our article should have 
come away from it with the view that we 
either argued or believe that inflation palli- 
ation always reduces economic welfare. We 
presented examples where that is not the 
case and provided as thorough a discussion 
as space permitted of why we believe that 
indexation is useful in  high-inflation 
economies. 

Jonas Agell and Bengt-Christer Ysander 
(1993) appear to read our paper incorrectly 
as arguing that inflation palliation always or 
almost always reduces welfare. They pre- 
sent two models in which inflation palliation 
improves welfare, as it does in some of the 
models in our paper. What should the 
reader make of their models and results? 
The first model is one in which wages are 


*Fischer: Department of Economics, Massachusetts 
Institute of Technology, Cambridge, MA 02139, and 
NBER; Summers: The World Bank, 1818 H Street, 
N.W., Washington, DC 20433, Harvard University, and 
NBER. We are grateful to Lant Pritchett for assis- 
tance. 
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set strategically by a union that faces a 
nonindexed tax system and that also cares 
about inflation. This is a standard model of 
strategic wage-setting, which has been used 
by John Driffill (1985) and Henrik Horn and 
Torsten Persson (1988) among others. It 
remains relevant to some European coun- 
tries. 

In this model, the impact of inflation mit- 
igation on welfare depends on parameter 
values, as it does in our 1989 paper when 
inflation is uncertain. In Agell and Ysander’s 
model, by setting the progressiveness of 
wage taxation appropriately, it is even possi- 
ble to reach the social optimum in their 
nonprecommitment wage- and price-setting 
game. In this situation, the union, which is 
penalized for inflation by the nonindexation 
of income tax, internalizes some of the so- 
cial costs of inflation. 

Agell and Ysander’s model produces a 
result that is very much in the spirit of the 
basic result in our paper, for as they show, it 
is optimal not to index wage taxation when 
income tax progression is set so that the 
first-best outcome is attained. This is an- 
other example of a model in which the 
introduction of indexation may reduce so- 
cial welfare—though to repeat, there are 
parameter values for which other inflation- 
mitigation measures improve social welfare. 

Agell and Ysander appear to place more 
weight on the final model presented in their 
paper, in which unanticipated inflation re- 
distributes wealth and is taken into account 
by the government and the wage-setting 
union. When the price level is higher than 
expected, both the government and the 
workers benefit. They gain at the expense of 
retirees who are not explicitly included in 
the model. The result here is that inflation 
mitigation reduces the inflation rate; that is, 
when the government and the union each 
benefit less from higher inflation, the infla- 
tion rate declines. 
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While this result is correct in the context 
of the model, we find the objective func- 
tions used in the model extremely unper- 
suasive. U.S. politicians would be aston- 
ished to learn that retirees have no political 
voice or impact on government preferences. 
We are also surprised that unions, which 
have extensive pension funds that are dam- 
aged by unexpected inflation whether in- 
vested in either bonds or stocks, are treated 
as welcoming unanticipated inflation. 

Agell and Ysander describe unexpected 
inflation as producing zero-sum redistribu- 
tions in their model. In fact, unexpected 
inflation benefits all participants whose de- 
cisions are registered in their model. We 
would have expected a loss function for the 
government that penalizes unanticipated re- 
distributions in general, rather than reward- 
ing redistributions away from the retired. 
Such an effect could be represented by in- 
cluding a quadratic term in unanticipated 
inflation in the objective function of the 
government. The inclusion of such a term 
does not change the conclusions of non- 
stochastic versions of the models used in 
either our 1989 paper or Agell and 
Ysander’s comment. 

Furthermore, changes in the cost of 
unanticipated inflation do not affect the ex- 
pected inflation rate in a stochastic version 
of our basic model, as discussed at the be- 
ginning of section II of our 1989 paper. 
Since the expected inflation rate does not 
change in that model when the costs of 
unanticipated inflation decline (correspond- 
ing to increased indexation) and since policy 
can do nothing in that model to reduce the 
variance of the inflation rate, inflation miti- 
gation would increase economic welfare. 
This is hardly surprising, since there is no 
feedback between the mitigation of the costs 
of inflation and the government’s choice of 
the inflation rate. 

We find nothing in Agell and Ysander’s 
paper that would cause us to revise the 
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general conclusions presented in the last 
two paragraphs of our 1989 paper (p. 387): 


This line of argument suggests that 
certain forms of inflation mitigation 
such as measures that protect Social 
Security Benefits from being eroded 
by increased inflation, or that prevent 
inflation from causing sharp increases 
in business tax burdens, may signifi- 
cantly increase equilibrium inflation 
rates and reduce welfare by eliminat- 
ing important sources of political op- 
position to inflation. On the other 
hand, measures like the use of infla- 
tion accounting or the issuance of in- 
dexed bonds, that do not undercut im- 
portant anti-inflation lobbies may have 
less-pronounced inflation-enhancing 
effects. 

... The general principle seems to 
be that in the presence of distortions, 
policies that would represent Pareto 
improvements if behavior were un- 
changed by their implementation, may 
be undesirable once their incentive 
effects are recognized. 
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ERRATUM 


Government Size and Economic Growth: A New Framework 
and Some Evidence from Cross-Section 
and Time-Series Data: Reply 


By Rati RamM* 


Please read Y in place of G on the left-hand side of each of the four equations for which 
regression estimates are reported on pp. 282-3 of my “Reply” in the March 1989 issue of 
this Review (Volume 79, No. 1, pp. 281-4). 


*Department of Economics, Illinois State University, Normal, IL 61761-6901. 
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Unification of Germany 


Gerlinde Sinn and 
Hans-Werner Sinn 
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random effects, unbalanced), com- LOGIT, GMM, minimum distance, 
plete regression diagnostics sample selection models, general 

maximum likelinood 
Programming Features: Symbolic 
equation substitution and differ- Time Series Estimation: ARIMA, 
entiation, matrix algebra equations, GARCH-M, PDLs, Shiller lags, VAR, 
loops over numbers or names, Kalman filter, AR1 and AR(p) 
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The Macroeconomic Picture 
of the '90s 


Trade and Development 
Report, 1992 


Industry and Development, 
1992/93 


“The world economy has been suffering its most 
severe recession since the Second World War. No 
single country can solve the macroeconomic prob- 
lem on its own: the situation demands improved 
coordination.” 


This annual report provides a thorough review of the 
world economic situation and reviews the significant 
trends and developments in international trade. It 
examines in:- 


Part I - 

The Current situation and Prospects - analysing 
the recent developments and short-term prospects of 
international markets; regional performance and short- 
term prospects of developing countries, China, Cen- 
tral and Eastern Europe and the former USSR; 
developed market-economy countries; and gives a 
long-term perspective on development and growth. 


Part II - 

Finance and Economic Expansion - examining 
recent trends, financial positions, renegotiation and 
reduction of bank debt, costs and other terms of 
private export credits and official debt reschedulings 
of the international capital markets and external debt; 
and the undercurrent of debt deflation, showing cur- 
rent conditions in financial markets of the United 
States and other countries, sources of current difficul- 
ties and the likely consequences for recovery. 


Part III - 

Policy Reform in Developing Countries - examin- 
ing the issues and the evidence from the 1970s and 
1980s, inward and outward orientation in develop- 
ment and the phasing of reform; and discusses the 
characteristics, performance and factors affecting 
performance in reforming public enterprises. It offers 
insight into areas of reform and draws some conclu- 
sions. 

E.92.11D.7 92-1-112321-6 $42.00 


“The importance of privatization as a favoured 
policy tool cannot be denied.” 


This year’s Global Report has thus taken up ‘Privati- 
zation’ as its special theme in an endeavour to help 
policy makers contemplate privatization for the first 
time as well as to reassure those who have embarked 
on that course and might be encountering problems. 
A chapter is dedicated to reviewing the recent priva- 
tization experiments in some 50 countries. The 
analysis is given an added dimension by answers 
from aquestionnaire survey of managers of industrial 
enterprises facing the challenge of privatization. 


It also offers a wealth of information in its analysis of: 

® the industrial performance of major economies 
around the world 

® eight Asian economies which demonstrate the sym- 
biotic relationships between manufacturing and 
agriculture and between manufacturing and ser- 
vices 

© twelve selected industrial sectors based on contri- 
butions from editors of professional trade journals 


Researchers and policy makers will find the statistical 


annex a valuable source of information. 
E.92.1ILE.4 92-1-106276-4 $55.00 548pp 


Send orders to: 


United Nations Publications, Sales Section, 
Room DC2-0853, Dept. 016A, 

New York, N.Y. 10017, Tel. (800) 253-9646, 
(212) 963-8302, Fax. (212) 963-3489. 


Visa, MasterCard and American Express 
accepted for orders over $15.00. 
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HEALTH CARE AND PENSION 
STUDIES FROM THE OECD 


U.S. Health Care at the Cross-Roads. 
(Health Policy Studies No. 1). 


U.S. Health Care costs are almost twice the average of other industrialized nations on a 
per capital basis. As health care moves to the top of the policy agenda in the United States, 
two issues—high and rising costs and inadequate access to care—have defined the policy 
debate. This report, drawn from the OECD Economic Survey of the United States 1992, 
analyzes health-care problems in the United States from an international perspective. The 
report concludes that real policy options for containing cost and increasing access are much 
narrower than they appear, and should be of interest to health care professionals, insurers, 
and anyone concerned with the direction of health care in the United States. 


11-92-03-2, November 1992, 99 pages, ISBN 92-64-13780-7, $15.00 


Private Pensions and Public Policy. 


Governments have begun to look to the private sector for ways in which retirement income 
might be provided without raising taxes in the years ahead, and which might also reduce 
pressures on the public budget in the short term. Often lacking, however, is a comprehen- 
sive view of all the roles played by private pensions, how well those roles are performed, 
what conflicts may be engendered between public and private goals, what potential private 
pensions may hold—and at what cost. This volume is a contribution to informed debate 


on the issue. 81-92-06-1, December 1992, 160 pages, ISBN 92-64-13790-4, $35.00 


Private Pensions in OECD Countries: the United States. 


Interest in changing the balance between public and private provision of retirement 
income, with increased reliance on private sources, is emerging in OECD countries. Al- 
though documentation and analysis of public systems are abundant, much less is known 
about the performance of private plans. This volume, one of a series of national case 
studies, describes the characteristics of private arrangements in the United States. It dis- 
cusses their prevalence, structure, financing and administration, tax treatment, regula- 
tion, and their effects on the distribution of retirement income. 


81-93-01-1, February 93, 100 pages, ISBN 92-64-13802-1, $22.00 


To order, send your check or money order to: 
OECD Publications and Information Center 
2001 L Street, NW, Washington, DC 20036 
Telephone: (202) 785-6323 
Book Orders: 800-456-OECD 
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NEW FROM 


THE FREE PRESS 


ADAM SMITH 
IN HIS TIME 


ANDOURS 


\ 


JERR 


YZ. MULLER 


ECONOMIC ADAM SMITH IN HIS 
PRINCIPALS TIME AND OURS 


The Masters and Mavericks Designing the Decent Society 
of Modern Economics Jerry Z. Muller, Catholic University 
David Warsh ¢Brilliant and compelling... Particularly 
€¢David Warsh’s column in the Boston Globe illuminating is its pervasive theme on the 

is unique: the only regular, thoughtful, and importance of unanticipated consequences 
frequenily irreverent discussion of economics _ in the workings and development of society 
and economists in any American newspaper. and economy. By enlarging upon its text and 


A large number of readers will be delighted to subtext, Muller has placed The Wealth of 
have available this wonderful and most timely _ Nations in its own time while enlarging upon 


collection of essays.°® its civil relevance for our time.9® 
—Henry Rosovsky, Harvard University —Robert K. Merton, Columbia University 
¢Over the years no journalist has kept up so 1992 0-02-922234-6 $22.95 


well and so accurately with the development of 


economic thought as David Warsh. His precise "JHE SEVEN FAT YEARS 


and interesting sketches have stood the testof _ and low to Do It Again 
years very well.9® 


—Kenneth J. Arrow, Stanford University Robert L. Bartley, Editor, 

6¢David Warsh has two qualities rare in econom- The Wall Street Journal 

ic journalism: he actually likes economics, and  ¢¢Bartley takes ‘supply side’ theory from the 

he knows a great deal about it. The columns realm of mystical gospel for a few to a thoughtful 
contained in Economic Principals are consistently _ exposition for the rest of us. In doing so, he 
well-informed, lively, and interesting. Together, _ has not only written a book that deserves to 


they provide a marvelous introduction to the be read, but one that will influence thinking 
current scene in economic thinking.°? about public 
—Robert E. Lucas, Jr., University of Chicago | —Paul Volcker 


1993 0-02-933996-0 $22.95 1992 0-02-901915-X $22.95 
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THE CATHOLIC ETHIC 
AND THE SPIRIT 
OF CAPITALISM 


Michael Novak, 
American Enterprise Institute 
*¢An original and exciting exploration of the 
moral structure of the free market economy, an 
excellent introduction to the catholic ethic (with 
a small ‘c’), and a social and economic manifesto 
for a new approach to overcoming problems of 
poverty, ethnicity, and race relations.°® 
—Jack Kemp 

1993 0-02-923235-X $22.95 


GENERATIONAL 

Who Pays—and 
a or What We Spend 
Laurence J. Kotlikoff, Boston University 


¢Kotlikoff has something important to say. 
He makes an urgent case for examining just 
who pays for what we spend.** 
—The Wall Street Journal 

1992 0-02-918463-0 $22.95 


THE WINNER’S CURSE 
Paradoxes and Anomalies 

of Economic Life 

Richard H. Thaler, Cornell University 


6¢Every now and then an economist writes a 
book that is both interesting and entertaining. 
Thaler has done just that.® 
—The Wall Street Journal 

1992 0-02-932465-3 $24.95 


THE COMPETITIVE 
ADVANTAGE OF 
NATIONS 

Michael E. Porter, 

Harvard Business School 


6¢An all-embracing view of economic change 
that amounts in the end to a powerful analytic 
framework...A real achievement.®® 
—The Economist 

1990 0-02-925361-6 $39.95 


Now in paperback— 


CAPITAL IDEAS 

The Improbable Origins of 
Modern Wall Street 

Peter L. Bernstein 

6¢An engaging history of the rise of investment 
theory, from French mathematician Louis 
Bachelier’s dissertation, ‘The Theory of Speculation, 
to the formulation of portfolio insurance.°* 
—USA Today 


1993 0-02-903012-9 $12.95 paper 


Named tivo of the five 
most cited books 
in the field of economics— 


THE ECONOMIC 
INSTITUTIONS OF 
CAPITALISM 
Firms, Markets, 
Relational Contracting 
Oliver E. Williamson, 
Yale University 

1987 0-02-934821-8 $17.95 paper 


MARKETS AND 
HIERARCHIES 
Analysis and Antitrust 
Implications 
Oliver E. Williamson, 
Yale University 

1983 0-02-934780-7 $14.95 paper 


For Visa, MasterCard, Discover, 
or American Express orders, 
call toll-free 1-800-323-7445 


Monday through Friday 
between 9am-9pm Eastern time. 


(Fp) THE FREE PRESS 


A Division of Macmillan, Inc./A Maxwell Macmillan Company 


866 Third Avenue, New York, NY 10022 
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Announcing A New Journal..... 


ECONOMIC DESIGN 


Editor-in-Chief: 


Murat Sertel, Bogazigi University, Turkey 


Co-Editor: 


William Thomson, University of Rochester, USA 


Honorary Editor: 


Leonid Hurwicz, University of Minnesota, USA 


Associate Editors 


A Board of approximately 
25 Associate Editors will be 
appointed. At this moment 
the Board consists of: 
Jeffrey Banks, University 
of Rochester, USA 
Salvador Barbera, 
Universidad Auténoma de 
Barcelona, Spain 

Bhaskar Dutta, Indian 
Statistical Institute, India 
Faruk Gil, Stanford 
University, USA 

Matthew Jackson, 
Northwestern University, 
USA 

Richard Kihlstrom, 
University of Pennsylvania, 
USA 

Paul R. Kleindorfer, 
University of Pennsylvania, 
USA 

Semih Koray, Middle East 
Technical University, Turkey 
Eric S. Maskin, Harvard 
University, USA 

Thomas R. Palfrey, 
California Institute of 
Technology, USA 

Stanley Reiter, 
Northwestern University, 
USA 

John E. Roemer, 
University of California, 
Davis, USA 

Ariel Rubinstein, 
University of Tel Aviv, Israél 
David Schmeidler, 
University of Tel Aviv, Israél 
Sanjay Srivastava, 
Carnegie Mellon University, 
USA 


Joseph E. Stiglitz, 
Stanford University, USA 
Kotaro Suzumura, 
Hitotsubashi University, 
Japan 


Aims and Scope 


In this Age of Economic 
Design, the accumulated 
traditions and wealth of 
knowledge in normative 
and positive economics 
and the strategic analysis 
of Game Theory are 
applied with novel ideas in 
the creative tasks of 
designing and assembling 
diverse legal-economic 
instruments. These include 
constitutions and other 
assignments of rights, 
mechanisms for allocation 
or regulation, tax and 
incentive schemes, 
contract forms, voting and 
other choice aggregation 
procedures, markets, 
auctions, organizational 
forms, such as 
partnerships, together with 
supporting membership 
and other property rights, 


NORTH-HOLLAND 


(An Imprint of Elsevier 
Science Publishers) 


and information systems. 
These designs, the 
methods of analysis used in 
their scrutiny, as well as the 
mathematical techniques 
and empirical knowledge 
they employ, along with 
comparative assessments 
of the performance of 
known economic systems 
and implemented designs, 
all of these form natural 
components of the subject 
matter of Economic 
Design. 


Call for Papers 


If you feel that your 
manuscript will fit the scope 
of the journal Economic 
Design, please send four 
copies to the editor: 
Professor M. Sertel 
Department of Economics 
Bogazici University 

Bebek, Istanbul, Turkey. 


Publication Schedule 


The first issue of 
Economic Design is 
scheduled for publication at 
the beginning of 1994. One 
volume of 4 issues will be 
published in 1994. 


ELSEVIER SCIENCE PUBLISHERS 
Attn: Heleen A.C. Gierveld 

P.O. Box 1991 

1000 BZ Amsterdam, The Netherlands 


in the USA/Canada: 


Attn: Judy Weislogel 
P.O. Box 945, Madison Square Station 
New York, NY 10160-0757 


419NNECONDES1 
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INTERNATIONAL 
ECONOMIC JOURNAL 


VOLUME 6 WINTER 1992 NUMBER 4 


The Role of International Trade in the Convergence of Per Capita GDP 

in the OECD: 1950-85 Farhad Rassekh 
Effect of Risk Aversion on the Incentive to Share Information H. S. Hwang and N. S. Lee 
International Migration and Welfare in the Presence of Non-Traded Goods: 

An Interconnected Markets Approach Bharati Basu 
Intra-Industry Trade: The Australian Experience R. Ratnayake and P. Athukorala 
Factor Price Shocks, Factor Substitution and Their Implication for Policy Z. Mahmood 
Protection, Terms of Technology Transfer and Foreign Investment: 

A Welfare Analysis M. Anam and A. B. Supapol 
The Cost of Capital and the Multi-Nationalization of Japanese Banking John Thornton 
Changes in the Extent of Financial Interdependence between the G7 

Countries in the 1970s and 1980s Mark J. Holmes and Eric J. Pentecost 


MANUSCRIPTS SUBMITTED (in triplicate) in Asia for publication should be sent to the Managing 
Editor, Professor Wontack Hong, Department of International Economics, College of Social Sciences, 
Seoul University, Seoul 151-742, Korea. Manuscripts from countries outside Asia should be sent to the 
Co-Editor, Professor Young Chin Kim, Department of Economics, Northern Illinois University, DeKalb, 
Illinois 60115, U.S.A. Personal subscription price of a volume (which includes postage) is $40 per year. 
Institutional subscription price is $50 per year. Cheques should be made payable to International 
Economic Journal and sent directly to Managing Editor. (Volumes 1~5 are available.) 


THE Review OF Futures MARKETS — Now AVAILABLE 


Volume 10, Number 2 contains seven research studies on topics of current importance. 


The Review of Futures Markets is a fully refereed, professional journal on futures and options. 
The cost of a subscription is $35 for all 3 issues, or $15 for a single issue. To order, complete the 
form below. Include your Visa or MasterCard number, or a check or money order made payable 
to the Chicago Board of Trade, and mail to: Chicago Board of Trade, Publications Department, 
141 West Jackson Blvd., Suite 2210, Chicago, IL 60604-2994 


Visa/MasterCard orders can also be made by phone; call 1-800-THE-CBOT, ext. 31 or 312-435-3558, ext. 31. 


ORDER FORM (check one) Ship To: (Please type or print clearly) 
US. only Total 

[1 Volume 10 (3 issues) $35.00 
CO) Volume 10, Number 2 $15.00 


Name 


Company 


Orders shipping outside the U.S. 
(Payment should be drawn on a U.S. bank ($)) Address 


CJ Volume 10 (Airmail) $55.00 
1) Volume 10 (Surface) $42.50 
CL) Volume 10, #2 (Airmail) $27.00 
LC) Volume 10, #2 (Surface) $20.00 


Country 
Visa/MC# 
( 
Exp. Date: / Telephone Number 
(Month/Year) 


Cardholder Name: 


Address 


City, State, Zip 
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NEW FROM 


GREENWOOD 
GPG PUBLISHING GROUPixe 


Economic Activity, Trade, ANZUS Economics 

and Industry in the Economic Trends and Relations 
U.S.—Japan-World Economy among Australia, New Zealand, 
A Macro Model Study of and the United States 

Economic Interactions Edited by Richard W. Baker and 

By F. Gerard Adams, Byron Gangnes, Gary R. Hawke 

and Shuntaro Shishido 


Examines the changing economic nature of 

A pioneering effort to link quantitatively the ANZUS Alliance. 


Praeger Publishers. 1993. 0-275-94488-3. $46.00E. 


Multinational Culture 


Social Impacts of a Global Economy 
Edited by Cheryl R. Lehman and 


The Age of Giant Corporations 
A Microeconomic History of 


American Business, 1914-1992, Russell M. Moore 

A Third Edition The result of a conference on multinational 
By Robert Sobel culture, this collection explores the sociocul- 
A popular and valuable text, The Age of tural impacts of a global economy. Exam- 
Giant Corporations is now available in a ined are: government policies & their 
third edition, bringing the history up to relationship to multinational activities; the 
1992 and including a new chapter on the era formation & regulation of international cap- 
of junk bonds and its aftermath. ital; labor market segmentation & protec- 
Greenwood Press. 1993. 328 pages. tionism; managing multinationals without 
0-313-28730-9. $49.95. sacrificing ethical standards or profits; envi- 
A paperback is also available: ronmental impacts; & other aspects of a 
0-275-94470-0. $18.95 global economy. 


Greenwood Press. 1992. 360 pages. 


Corporate Environmentalism in 0-313-27822-9. $55.00. 


a Global Economy 


Societal Values in International America and the Multinational 

y Malina Szejnwa rown, Patric 
Derr, Ortwin Renn, and Allen L. White pot a 
with a chapter by Jeanne X. Kasperson Partnership 
and Roger E. Kasperson By John J. Reardon 
Analyzes values and perceptions relevant to Tells the story of the postwar relationship 
multinational corporations’ hazardous facil- between America’s multinational corpora- 
ity siting in developing countries. tions and the U.S. government. 
Quorum Books. 1993. 264 pages. Praeger Publishers. 1992. 194 pages. 
0-89930-802-3. $49.95. 0-275-93918-9. $45.00. 


ORDER TOLL-FREE, 24 HOURS-A-DAY: 1-800-225-5800 
Please call us for a free catalog at (203) 226-3571 


GREENWOOD PRESS - PRAEGER PUBLISHERS - QUORUM BOOKS 


88 Post Road West, Box 5007, Westport, CT 06881-5007 
AD 93 (203) 226-3571 » FAX (203) 222-1502 
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Journal of Forensic Economics 


A NEW JOURNAL IN A NEW FIELD 


A tri-annual, refereed academic journal which publishes both theoretical and 
applied papers concerning research and techniques in forensic economics. Articles 
encompass topics of interest to economists, attorneys and vocational experts 
engaged in such fields as personal injury/wrongful death litigation, business 
valuation, employment discrimination, commercial litigation, and anti-trust 
litigation. Once each year bibliographies and a listing of members are published. 


The JFE is published by the NATIONAL ASSOCIATION OF FORENSIC ECONOMICS, a non-profit 
association of economists, financial analysts, rehabilitation specialists, and attorneys in all 50 states 
as well as Canada, England, Germany, Hong Cong, Ireland, Japan, Puerto Rico, South Africa and the 
Virgin Islands. NAFE is not an accrediting association, but is a professional organization designed to 
foster scholarly debate in issues of forensic economics. NAFE membership includes a subscription to 
the JFE, participation in regional and national NAFE programs, a membership directory, quarterly 
newsletters informing our members of new research in forensic economics, new applications of 
economics in litigation, new papers being presented, upcoming meetings and calls for papers. 


MEMBERSHIP AND ADDITIONAL INFORMATION: 


National Association of Forensic Economics 
Post Office Box 30067 
Kansas City, MO 64112 
(816) 235-2833 FAX: (816) 235-5263 


Advanced Placement 
Dedicated to Educational Excellence 
for Nearly Four Decades 


Faculty Consultants for the AP Reading 


Each year, the College Board’s Advanced Placement (AP™) Program gives hundreds of thousands 
of exceptionally able high school students an opportunity to take rigorous college-level courses and 
appropriate exams in 16 disciplines. More than 2,900 colleges and universities offer credit or 
advanced standing to students based on their exam performance. For six days in June, more than 
2,400 college faculty and AP teachers from across North America gather at Trenton State College, 
Clemson University and Trinity University at the annual AP Reading to evaluate and score students’ 
essays. The participants also exchange ideas and contribute suggestions about their discipline, their 
courses, and the AP Examinations. Participants are paid honoraria, provided with housing and meals, 
and reimbursed for travel expenses. 

Applications are now being accepted for Faculty Consultants to the College Board’s Advanced 
Placement Readings in Economics. Applicants should currently be teaching or directing instruction 
for the first-year college course in either Microeconomics or Macroeconomics. 

For an application or additional information, please contact: Dr. Walt MacDonald, Director, 
Advanced Placement Program, Educational Testing Service, Rosedale Rd. 85-D, Dept. 93-19, 
Princeton, NJ, 08541. Educational Testing Service is an Equal Opportunity/Affirmative Action 
Employer and especially encourages minorities and women to apply. 
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Activate Your Students’ Interest! 


Principles of Economics 
James R. Kearl, Brigham Young University 


These texts are accompanied by unique problem-solving workbooks by Mark Rush! Not the 
standard books of fill-in-the-blanks, these singular workbooks offer extensive problem sets 
developed directly from recent data and articles in The Wall Street Journal and the New York 
Times. Kearl and Rush's "learning-by-doing" approach treats economics as a tool to be used, 


not just memorized. 


Features of all three texts 

e Each chapter ends with 
fully solved sample prob- 
lems (both qualitative and 
quantitative). 

¢ Incentives and behavior is 
treated explicitly as a core 
tool in Chapters 1 and 2, 
then developed throughout. 

¢ Singular international 
coverage begins with trade 
and exchange rate theory in 
Chapter 4. 


Features of Microeconomics 

¢ No text better explains how 
markets clear, with a 
stronger emphasis on 
supply and demand, 
relative pricing elasticity, 
and substitution effects. 

e Law and economics is 
stressed throughout; for 
example, tort law appears 
as an alternative to eco- 
nomic regulation. 


Features of Macroeconomics 
e This text is the first to 
devote the beginning 


macro chapter almost 
exclusively to growth and 
instability; growth is then 
integrated throughout the 
text. 

Extensive applications 
cover the current reces- 
sion/stagnation. 

By presenting exchange 
rates up front, Kearl is 
better able to explain the 
international macro events 
reflected daily in the news, 
such as the effects of 
German-U.S. interest rate 
differentials. 


For more information, call your D. C. Heath representative toll-free: 1-800-235-3565. 


A Raytheon Company 


College Division ¢ 125 Spring Street * Lexington, MA 02173 
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Playing by the Rules 


Games and 
Economic 
Behavior 


Editor 
Ehud Kalai 


Northwestern University 
Evanston, Illinois 


Games and Economic 
Behavior .. . 


¢ Publishes original and survey 
papers dealing with game-theoretic 
modeling in the social, biological, 
and mathematical sciences 


e Presents papers that are math- 
ematically rigorous as well as 
accessible to readers in related 


fields 


¢ Facilitates cross-fertilization 
between the theory and application 
of game-theoretic reasoning 


Volume 5 (1993), 4 issues 

In the U.S.A. and Canada: $133.00 
All other countries: $161.00 

ISSN 0899-8256 


New 


Journal of 
Housing 
Economics 


Editor 
Henry O. Pollakowski 


Joint Center for Housing Studies 
Harvard University, Cambridge 
Massachusetts 


Journal of Housing 
Economics... 


e Provides a focal point for the 
publication of economic research 
related to housing 


e Encourages papers that bring to 
bear careful analytical technique 
on important housing-related 
questions 


e Covers the broad spectrum of 
topics and approaches which 
comprise housing economics, 
including analysis of important 
public policy issues 


Volume 3 (1993), 4 issues 

In the U.S.A. and Canada: $110.00 
All other countries: $135.00 

ISSN 1051-1377 


Sample copies and privileged persona 
For more information, please write or 


ACADEMIC PRESS, INC., Journal Promotion Department 
1250 Sixth Avenue, San Diego, CA 92101, U.S.A. 
(619) 699-6742 


All prices are in U.S. dollars and are sub 
Canadian customers: Please add 7 


tes are available upon request. 


ect to change without notice. 
is and Services Tax to your order. 
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singular Europe 


ECONOMY AND POLITY OF THE EUROPEAN 
COMMUNITY AFTER 1992 


William James Adams, Editor 


CONTRIBUTORS: 

Severin Borenstein ¢ Air Travel 

Bo Carlsson * Smail Business 

Gunter Dufey * Banking 

Reinhard Ellger * Telecommunications 
John H. Jackson * World Trade 

Alexis Jacquemin ¢ Corporate Strategy 
Frédéric Jenny * Competition 

Stephan Leibfried * Social Policy 
Ernst-J. Mestmacker * Germany 
Tommaso Padoa-Schioppa * Monetary Union 
Gary R. Saxonhouse ¢ Japan 

John D. Steinbruner ¢ United States 
William Wallace ¢ Foreign Policy 
Joseph H. H. Weiler ¢ Political Union 
Susan L. Woodward » United States 


The future impact of Program 1992 upon the 
European and world economies is discussed 


by leading international experts. 
cloth $45.00 / £30.00 


Sponsored by the Chicago Council on Foreign Relations 


The University of Michigan Press 
839 Greene Street, PO Box 1104 
Ann Arbor, Michigan 48106-1104 
Plymbridge Ltd. 
Estover, Plymouth PL6 7JA UK 
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Journal of 
Economic Integration 


Solicits Papers to Compete for 
the Annual Daeyang Prize 


Volume 7, No. 1 Spring 1992 The Journal is published biannually (in Spring and Autumn) 
by the Institute for International Economics, Sejong University, 
Seoul, Korea. 


Ronald W. Jones, Factor | Journal of Economic Integration will publish articles that investigate 
Scarcity, Factor Abundance economic issues in international trade and finance. While the em- 
and Attitudes Towards | phasis is on the broad issue of international economic integration, 
Protection: The 3 x 3 Model | the Journal is interested in scholarly articles on applied and theo- 
| retical issues of international economics. Thus, the Journal intends 
to provide a forum for theorists, empirical researchers, and policy 
makers. 


Edward Tower, Domestic 
Resource Cost 


pe Aeon and | = From the papers selected for publication, the Prize committee will 


choose the best manuscript to receive the US$5,000 Daeyang 
Prize. The winner of the Prize is announced in the Spring issue 
each year. 


Intertemporal Optimization 
under Threat of VER 


ea Nee oy ® Manuscripts should be in English, typed in double spacing with 

and the Current Account | wide margins (3 cm) on one side of standard paper. Footnotes, 
figures and tables should be on separate sheets. 

Marcelo Bianconi, Monetary | 

Growth Volatility and Asset = Manuscripts should be sent in triplicate to the Managing Editor, 

Prices in a Two-Country Cash-_ | Institute for International Economics, Sejong University, Seoul, 

in-Advance Model | Korea. 


Myung-gun Choo, Provisional | = Subscription prices are US$20 per year for individuals and US$30 
PARFTA (Pacific Rim Free | per year for institutions. All issues are normally sent by surface. 
Trade Agreement) and Its | Additional USS$5 per year is required for air delivery. Checks and 
Economic Effect | money orders are payable to the Sejong University. 


Institute for International Economics 
Sejong University 
Seongdong-ku, Seoul, Korea 133-747 Tel: (02) 460-0338 Fax: (02) 460-0200 
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How Many Candles Were On 
Your Cake The Last 
Time You Thought About 
Buying Insurance? 


Flexible. 
it’s been a long 
time. Styles have Updating 


your insurance 
— doesn’t have to 
be a hassle. With our Term Life 


changed. So has 
your family, maybe 
even your job. 


And most likely, the insurance you Plan, as your needs change, so can 
bought then isn’t enough to cover a eee Ins peat through 
your family today. That's why you your association is designed to grow 
need insurance that you can easily with you—it even moves with you 
update as your life changes — AEA when you change jobs. 
Term Life Insurance. Affordable. 
Designed For You. Seabury & Smith offers AEA 
Finding insurance that’s right members the additional benefit of 
for you can be difficult and time reasonable rates, negotiated using our 
consuming. But Seabury & Smith has group purchasing power. Find out 
done that work for you. Through their © more about AEA Term Life insurance; 
research, they’ve found a Term Life call 1 800 424-9883 (in Washington, DC, 
Plan designed to help members of our (202) 457-6820) between 8:30 a.m. and 
profession provide for the future of 5:30 p.m. Eastern Time. 


their families. What’s more, you can AEA Insurance 
be sure the plan is constantly being : : 
evaluated to better meet the needs of = 


nd tomorrow. 
AEA members. Administered by Seabury & Smith. 
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MicroISP... 
the leader in Histogram of 


60 
3 
econometric software | =| afi 
MicroISP 7.0 for PC Compatibles 
The best selling econometric software is now even + 75 & 6 
better. New features include a wide range of model 3 


evaluation & hypothesis tests, recursive estimation, 
page layout features in graphics, forecast confidence 
intervals, and much more. Figure 1 


Unemployment Regressed on Dis 
MicroTSP 6.6 for the Macintosh 


This all new version was designed to make full use 
of the Apple Macintosh. View and manipulate 
multiple windows of statistical output and graph- 


ics using the standard Macintosh interface. Data z 
capacity is limited only by available memory. z 

Student Versions for PC $ 
and Macintosh : 
These limited data capacity versions include 
all of the econometric and graphic techniques is 
of MicroTSP 6.5. Price is comparable to the vat 5 iterations 

1.5 4.1 “1.911 

All Versions of MicroI'SP 
Menu, command, or batch operation ¢ linear 
and nonlinear estimation of equations & systems 
¢ Box-Jenkins ARMAX exponential smoothing 
logit probit ¢ PDLs VAR forecasting model 
simulation handling of missing data extensive of 
high resolution graphics ¢ data transfer ¢ and Act, am 
much more. a] 


Write or call for more information and a free 
demonstration disk. Academic and quantity 
discounts are available. 


MICRO 


TSB. 


Quantitative Micro Software 
4521 Campus Drive, #336, Irvine, CA 92715 , 
(714) 856-3368 © FAX (714) 856-2044 2 
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Take a look... see for 
yourself. This 
teaching tool will 
help you unlock your 
students’ potential. 
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Bavid Borkewsky 
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New York, NY 10822 
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THE CHOICE 
IS CLEAR... 
Microeconomics, 2/e 
has met the needs | 
of an extremely | 
diverse prow of 
instructors and 
students acress the 
country This dynamic 
and engaging text | 
makes the study of | 
microeconomics 
pertinent and exciting 
for students, 
for more information, please 
| 


Announcing a New Handbook for 1993........ 


Handbook of Natural 
Resource and Energy 
Economics 


Volume Ill 


edited by A.V. Kneese and J.L. Sweeney 


Handbooks in 
Economics, 6 


The Handbook of Natural 
Resource and Energy 
Economics examines the 
current theory, and 
samples current 
application methods for 
natural resource and 
energy economics. This 
third volume of the 
Handbook deals primarily 
with non-renewable 
resources. It analyzes the 
economics of energy and 
minerals, and includes 
chapters on the 
economics of 
environmental policy. 


The Handbook provides a 
definitive source, 
reference, and teaching 
supplement for use by 
professional researchers 
and advanced graduate 
Students. The surveys 
summarize not only 
received results but also 
newer developments from 
recent journal articles and 
discussion papers. 


Please mention Tue 


Contents: Some Basic 
Concepts. Economics of 
Depletable Resources: An 
Introduction (J. Sweeney). 
The Optimal Use of 
Exhaustible Resources 
(G.M. Heal). intertemporal 
Consistency Issues in 
Depletable Resources (L. 
Karp, D. Newbery). 
Analytical Tools. Buying 
Energy and 
Nonfuel-Minerals: Final, 
Derived, and Speculation 
Demand (M. Slade, C. 
Kolstadt, R. Weiner). 
Mineral Resource Stocks 


and Information (D. Harris). 


Strategies for Modeling 
Exhaustible Resource 
Supply (D. Epple, J. 


NORTH-HOLLAND 


An Imprint of Elsevier 
Science Publishers 
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Londregan). Applications 
to Policy and Forecasting 
issues. Natural Resources 
in an Age of Substitutability 
(P. Dasgupta). Natural 
Resource Cartels (D. 
Teece, D. Sunding, E. 
Mosakowski). The 
Economics of Energy 
Security: Theory, 
Evidence, and Policy (M. 
Toman). Natural Resource 
Use and the Environment 
(C. Kolstad, J. 
Krautkraemer). Energy, 
The Environment, and 
Economic Growth (D.W. 
Jorgenson, P.J. Wilcoxen) 


1993 about 500 pages 
Price: US$ 110.00 / Dfl. 250.00 
ISBN 0-444-87800-9 


The US$ price is valid in the 
USA/Canada only. In all other 
countries the Dutch Guilder ((Dfi.) 
price is definitive. 


ELSEVIER SCIENCE 
PUBLISHERS 

Attn: Heleen A.C. Gierveld 
P.O. Box 1991 

1000 BZ Amsterdam 

The Netherlands 


in the USA/Canada: 

Attn: Judy Weislogel 

P.O. Box 945 

Madison Square Station 
New York, NY 10160-0757 
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“‘T believe the key to understanding the theory is to use it.” 


--David N. Hyman, regarding his textbook 
MODERN MICROECONOMICS: Analysis and Applications 


Available Now! 


MODERN MICROECONOMICS 

Analysis and Applications 

Third Editi 
David 


When we say MODERN, we mean: 


¢ New! Modern Microeconomics in Action--self-contained 
short essays entitled International Report, The Business World, 
or Government and the Economy appear in every chapter. 
These essays provide up-to-date insights, information, and 
analysis of microeconomic issues. 


¢ Three New Chapters covering modern topics: 
Chapter 6: Choices Under Uncertainty 
Chapter 13: Business Strategies in Oligopolistic Markets 
Chapter 17: Asymmetric information and Market Performance 


¢ Coverage of many new “hot topics” including: Environmental 


Protection Policy, TQM, Advances in Technology, New Antitrust 
Cases and much more! 


“If the scale is from 0-3, Hyman's revised text deserves a 
three plus...Hyman has done a truly outstanding job in 
developing a readable and applicable text...” 


--Dr. Charles G. Leathers, The University of Alabama 


IRWIN 
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HYCELLENCE IN ECONOMICS 


WE SUPPLY WHAT YOU AND YOUR STUDENTS DEMAND. 


RYAN C. AMACHER, University of Texas at Arlington @ ALAN J. AUERBACH, 
University of Pennsylvania ¢ ScoTT BARRETT, London School of Business 
ROBERT A. BECKER, Indiana University ¢ THEODORE Bos, University of 
Alabama at Birmingham @ ELBERT V. BOWDEN, Appalachian State University 
WILLIAM J. BoYEs, Arizona State University ¢ GARDNER BROWN, JR., 
University of Washington @ WILLIAM F. CHAPELL, University of Mississippi 
REX L. CoTTLe, Wichita State University ¢ DENNIS F. ELLIs, University of 
Michigan, Flint ¢ A. MYRICK FREEMAN, Bowdoin College ¢ DAVID FRIEDMAN, 
Cornell University ¢@ THOMAS J. HAILSTONES, University of South Florida 
W. MICHAEL HANNEMAN, University of California, Berkeley @ FORREST 
HARLOow, Angelo State University ¢@ DAVID R. KAMERSCHEN, The University of 
Georgia @ MWANGI S. KIMENYI, University of Connecticut, Storrs @ MEIR KOHN, 
Dartmouth College @ LAURENCE J. KOTLIKOFF, Boston University @ FRANK V. 
MASTRIANNA, Slippery Rock University of Pennsylvania @ KENNETH E. 
MCCONNELL, University of Maryland ¢ WiLLiIAM A. MCEACHERN, University of 
Connecticut, Storrs @ PAUL. NEWBOLD, University of Illinois, Champaign-Urbana 
PAUL R. PoRTNEY, Resources For the Future @ FRANKLIN R. ROOT, The 
Wharton School, University of Pennsylvania ¢ MARTIN C. SCHNITZER, Virginia 
Polytechnic Institute and State University @ WILLIAM F. SHUGHART II, University 
of Mississippi GENE SMILEY, Marquette University V. KERRY SMITH, North 
Carolina State University ¢ DALE B. TRUETT, The University of Texas at San 
Antonio @ LiLA J. TRuETT, The University of Texas at San Antonio ¢ HOLLEY 
ULBRicH, Clemson University @ LLOYD M. VALENTINE, University of Cincinnati 


TO LEARN MORE ABOUT. CONTACT YOUR LOCAL REPRESENTATIVE OR WRITE: 


COLLEGE DIVISION South-Western Publishing Co. 


$101 Madison Road 
Cincinnati, Ohio 45227-1490 
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Addison-Wesley and the Santa Fe Institute 
announce the publication of the first book on 
the key topic for all finance economists... 


The Double Auction Market 
Institutions, Theories, and Evidence 


Daniel Friedman, University of California at Santa Cruz 
John Rust, University of Wisconsin 


"With computer networks loosening the grip of traditional intermediary car- 
tels on market structure, the Friedman-Rust compendium takes on consider- 
able significance. Academic theories tested heretofore only in the laboratory 
are no longer merely academic; one or more of the models analyzed in this 
book will undoubtedly serve as the basis for successful real markets in the 
future." 


-Steve Wunsch, President and CEO, AZX Inc. 


"Popular images of frenzied bidding in double-auction markets obscure their 
smooth operation in aggregating dispersed information and allocating trades 
efficiently. The task appears complex but experiments confirm that a few 
competing traders, each with limited information and computational ability, 
can approximate a perfect market. Unraveling how markets really work 
nowadays involves the implementation of competing designs for computer- 
ized markets. This volume's comprehensive up-to-date semmary of what's 
known sets an agenda for practical studies of market organization." 

-Robert B. Wilson, Stanford University 


With seminal papers by: 


Colin Camerer Charles Plott 
Dan Friedman John Rust 
John Kagel Vernon Smith 
John Ledyard Robert Wilson 


The Double Auction: Institutions, Theories, and Evidence is available wherev- 


er fine professional books are sold or by calling Addison-Wesley at 1-800- 
447-2226. 


ISBN 0201622637 Hardcover 419 pages $47.50 
Santa Fe Institute ISBN 0-201-62459-1 Paperback 419 pages $24.75 9 
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A TRULY OPEN ECONOMY MACROECONOMICS TEXT. 


SACHS/LARRAIN 


MACROECONOMICS IN THE GLOBAL ECONOMY 


Jeffrey D. Sachs, Harvard University, and Felipe B. Larrain, Pontifica Universidad 
Catolica de Chile 


MACROECONOMIC THEORY IN ACTION. 
IN POLAND. IN RUSSIA. 
IN BOLIVIA. IN THE WORLD. 
TAKE A LOOK. 


CONOMICS: 


PR HALL 


For an examination copy, please contact your local Prentice Hall representative, or write to 
Prentice Hall, College Operations, Englewood Cliffs, NJ 07632. 
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“NANYANG TECHNOLOGICAL UNIVERSITY 


SCHOOL OF ACCOUNTANCY AND BUSINESS 


Faculty Appointments In Economics and Statistics 


The Nanyang Technological University is a comprehensive university which offers courses at degree and 
postgraduate levels. It has 33 academic divisions organised within the present 10 schools. There are some 
864 faculty and research staff and the current student enrolment stands at 10,890, including 490 
postgraduate students. 


The School of Accountancy and Business offers undergraduate degrees in accountancy, and business, 
graduate degrees by research, as well as MBA degrees. Among its undergraduate business programmes 
is a programme specializing in Applied Economics. 

Applications are invited for faculty positions in Economics and Statistics in the School. Applicants are 


expected to possess a relevant higher degree, preferably at doctoral level, with expertise in the following 
areas of specialization: 


Economics 

° Economic Theory (Micro and Macro) * Social Accounting and Economic Development 
* Mathematical Economics e Labour Economics 

Econometrics Industrial Economics 

¢ Money and Banking/Monetary Economics ° Financial Economics 

° Public Finance e Business Forecasting 

° International Economics e Urban and Transport Economics 


Statistics 
* Stochastic Processes ¢ Advanced Quantitative Analysis 


Sound teaching, professional and research experience are also criteria in recruitment. The successful 
candidate is also expected to undertake curriculum development work, actively worked at establishing links 
with the industry within and outside Singapore, and implement management development programmes. 
Gross annual emoluments range as follows: 


Professor : $$108,870 - S$146,970 
Associate Professor :S$ 88,650 - S$122,870 
Senior Lecturer :S$ 58,680 - S$$100,310 
Lecturer :S$ 50,390 -S$ 64,200 


(US$1 - S$1.62 approximately) 


The commencing salary will depend on the candidate's qualifications, experience and the level of 
appointment offered. 


In addition to the gross annual emoluments, the University adopts the Government's practice in the payment 
of annual variable component/allowance, the quantum of which is tied to national economic performance 
and has, in past years, ranged from 1 to 3 months’ salary. Other benefits, depending on the type of contract 
offered, include provident fund benefits or an end-of-contract gratuity, nage Bo allowance, subsidised 
housing, children’s education allowance, passage assistance, and baggage allowance for transportation 
of personal effects to Singapore. Leave and medical benefits will also be provided. Staff members may 
undertake consultation work of a specialist nature, subject to the approval of the University, and retain 
consultation fees up to a maximum of 60% of their gross annual emoluments in a calender year. 


Applicants should send their curriculum vitae, including the names and addresses of three referees to: 


Director of Personnel 
NANYANG TECHNOLOGICAL UNIVERSITY 
Nanyang Avenue 
Singapore 2263 
Telefax: (65) 7919340 
Internet: Kelly@admin.ntu.ac. sg 
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PRENTICE HALL ECONOMICS 1993 


Thompson/Formby 
Economics of the Firm, Sixth Edition 
Arthur A. Thompson Jr. and John P. Formby, 
both of The University of Alabama 


Bails/Peppers 
Business Fluctuations: Forecasting Techniques and Applications, 
Second Edition 
Dale G. Bails, Memphis State University, and Larry C. Peppers, 
Washington and Lee University 


Just/Hueth/Schmitz 


Economics and Public Policy, Second Edition 


Richard E. Just and Darrell L. Hueth, both of the University of Maryland, 


and Andrew Schmitz, University of California—Berkeley 


Duetsch 
Industry Studies 
Larry L. Duetsch, University of Wisconsin—Parkside 


Heilbroner 
The Making of Economic Society, Ninth Edition 
Robert L. Heilbroner 


Blau/Ferber 
Economics of Women, Men and Work, Second Edition 
Francine D. Blau and Marianne A. Ferber, 
both of University of Illinois at Urbana-Champaign 


CONOMICS: 


PR E HALL 


For an examination copy, please contact your local Prentice Hall representative, or write to 
Prentice Hall, College Operations, Englewood Cliffs, NJ 07632. 
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Oxford 


Understanding Consumption 
ANGUS DEATON 

“The book does an impressive job of 
integrating the insights available from many 
sources, drawing on theory as well as 
econometric evidence, and micro as well as 
macro models. Understanding Consumption 
is lucid, intuitive, and eminently read- 
able.”—Marjorie Flavin, University of 
Virginia. 

(Clarendon Lectures in Economics) 

1992 256 pp. paper $18.95 cloth $35.00 


Understanding the Market 
Economy 

ARNE JON ISACHSEN, CARL B. HAMILTON 
and THORVALDUR GYLFASON 

This book offers a comprehensive analysis of 
the workings of a planned economy versus 
that of a market economy. It expounds the 
principles of market economics in the 
context of the main problems facing econo- 
mies in transition from a centrally planned 
to a market structure. 

1992 264 pp.; illus. paper $18.95 cloth $45.00 


New in paperback! 

Dominating Knowledge 
Development, Culture, and Resistance 
Edited by FREDERIQUE APFFEL 
MARGLIN and STEPHEN A. MARGLIN 
These essays examine the role of Western 
technology in development, and its effect on 
traditional cultural values and social 
practices. 

(WIDER Studies in Development Economics) 
1990 (paper 1993) 312 pp. 

paper $24.95 cloth $49.95 


Second Thoughts 

Myths and Morals of 

US Economic History 

Edited by DONALD N. McCLOSKEY 

“The caliber of the reasoning and the 
presentation of the historical evidence will 
place this book at the center of economic 
policy debates. It will be read by those who 
agree with the argument, and more impor- 
tant, by those who expect to disagree with it. 
Second Thoughts is an excellent volume to 
be used by students as well as by general 
readers.”—Stanley R. Engerman, University 
of Rochester; coauthor of Time on the Cross. 
1993 224 pp. $24.95 


New in paperback! 

Theorists of Economic 
Growth from David Hume to 
the Present 

With a Perspective on the Next Century 


“This is the rich 
and illuminating 
work we expect 
from the pen of 
Professor Rostow, 
as one of the 
persons most 
knowledgeable 
about both 
economic growth 
and the history of 
economic ideas. 
No one who 
works in these 
fields can afford 
to overlook this 
book.”—William J. Baumol, Director, C.V. 
Starr Center for Applied Economics. 

1990 (paper 1992) 736 pp.; illus. 

paper $19.95 cloth $49.95 


Fatal Tradeoffs 

Public and Private Responsibilities 
for Risk 

W. KIP VISCUSI 

“This is an important contribution to our 
understanding of the economics and social 
policy of controlling the risks to life. Viscusi 
has done outstanding work in this field, and 
much of it is represented here. The treat- 
ment is insightful and comprehensive. Fatal 
Tradeoffs is essential reading for anyone 
who wants the most current information and 
assessment of this important subject.” 
—Sherwin Rosen, University of Chicago. 
1992 320 pp.; illus. $37.00 


Prices are subject to 
change and apply only in the U.S. 
To order, send check or money order to: 
Social Sciences Marketing, Dept. PK. 
To order by phone using major credit cards 
please call 1-800-451-7556. 


Oxford University Press 
200 Madison Avenue ¢ New York, NY 10016 
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ECONOMICS, COORDINATION AND CONTROL 


McCORMICK 


MANAGERIAL ECONOMICS 
Robert E. McCormick, Clemson University 


ROBERT E. 
MCCORMICK 


MANAGERIAL 
ECONOMICS 


PROPERTY RIGHTS. INFORMATION COSTS. 
VERTICAL INTEGRATION. LOTS MORE. 
MODERN STUFF. [TAKE A LOOK. 


F\CONOMICs: 


PREM HALL 


For an examination copy, please contact your local Prentice Hall representative, or write to 
Prentice Hall, College Operations, Englewood Cliffs, NJ 07632. 
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Economically Speaking 


Making a Market 
The Institutional Transformation 
of an African Society 


Jean Ensminger 

In this book, Jean Ensminger analyzes the proc- 
ess by which the market was introduced into 
the economy of a group of Kenyan pastoralists. 
She employs neoinstitutional economic analy- 
sis in order to assess the impact of new market 
institutions on production and distribution, 
with particular emphasis on the effect of insti- 
tutions on decreasing transaction costs over 
time. She is able to trace the effects of increas- 
ing commercialization on the economic well- 
being of individual households, rich and poor 
alike and to analyze the process by which insti- 
tutions themselves are transformed as a market 
economy develops. 

Political Economy of Institutions 

and Decisions 

42060-1 Hardcover $49.95 


The Methodology of Economics 
Or How Economists Explain 

Second Edition 

Mark Blaug 


“This is a well-written and stimulating book... 
The summaries of the relevant literature are 
very good; its critical comments are balanced 
and fair. Many economists will find it a useful 
book.” —Journal of Economic Literature 
**...If one is looking for a single book that 
might safely be put into the hands of eco- 
nomics students to inform them about relevant 
developments in the philosophy of science, 
about the tradition of self-conscious appraisal 
in their own discipline, and about the empiri- 
cal substance of what they will almost every- 
where be taught as mainstream economics, 
then this is it.” 

—Neil De Marchi, The Economic Journal 
43061-5 Hardcover $54.95 /43678-8 Paper $17.95 


Institutions and Social Conflict 
Jack Knight 

Many of the fundamental questions in social 
science entail an examination of the role 
played by social institutions. Why do we have 
so many social institutions? Why do they take 
one form in one society and quite different 
ones in others? In what ways do these institu- 
tions originally develop? And when and why 
do they change? This book addresses these 
questions. 

Political Economy of Institutions 

and Decisions 

42052-0 Hardcover $54.95 /42189-6 Paper $16.95 


Economic Transformation 

in Eastern Europe and the 
Distribution of Income 

A. B. Atkinson and M. Micklewright 


Who gains and who loses from economic 
transformation in Eastern Europe is a key ques- 
tion—but one that is too rarely understood. 
This book assembles evidence about earnings, 
dispersion, income inequality and poverty in 
Czechoslovakia, Hungary, Poland, The USSR, 
Russia, The Ukraine, etc. 

43329-0 Hardcovr $64.65 / 43882-9 Paper $18.95 


Theories of Political Economy 
James A. Caporaso and 


David P. Levine 

““Caporaso and Levine skillfully map out the ter- 
rain of political economy....The consideration 
of policy issues nicely illustrates and comple- 
ments the more abstract points. Theories of 
Political Economy combines the virtues of both 
monograph and text and will edify scholars 

and students of political economy and policy 
studies.” —David F Weiman, Yale University 
This exploration of some of the more impor- 
tant frameworks used for understanding the 
relationship between politics and economics 
includes the classical, Marxian, Keynesian, 
neoclassical, state-centered, power-centered, 
and justice-centered. 

41561-6 Hardcover $49.95 /42578-6 Paper $16.95 


Advances in Economic Theory: 
Sixth World Congress 


Jean-Jacques Laffont, Editor 

These books comprise papers examining the 
latest developments in economic theory given 
at the Sixth World Congress of the Economet- 
ric Society in Barcelona in August 1990. This is 
the latest in a series of collections that cover 
the most active fields in economic theory over 
a five year period. These books give the reader 
a unique survey of the most recent advances in 
economic theory. 


Volume 1 

Contributors: Ken Burmore, Eric Van Damme, 
Drew Fudenburg, Eddie Dekel, David Pearce, John 
Moore, Ariel Rubenstein, Mathias Dewatripont, 
Thomas Palfrey 

Econometric Society Monograpbs 20 
41666-3 Hardcover $54.95 


Volume 2 

Contributors: Lawrence Epstein, Milton Harris, 
Arthur Raviv, Jean Tirole, Patrick Bolton, Darrell 
Duffice, David Cass, Michael Woodford, Roger 
Guesnerie, Lawrence Jones, Andreu Mas-Colell 
Econometric Society Monographs 21 
43019-4 Hardcover $54.95 
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Economically Speaking 


Capital and Credit 
A New Formulation of 
General Equilibrium Theory 


Michio Morishima 

Professor Morishima provides a much-needed 
synthesis of growth theory and monetary 
theory, drawing on the work of Schumpeter, 
Keynes and the prewar neoclassical economists 
to formulate a capital-theoretic general equilib- 
rium theory. 

41840-2 Hardcover $49.95 


Public Choice Analysis in 
Historical Perspective 

Alan Peacock, Editor 

Alan Peacock poses the question of whether 
the history of economic thought is an essential 
part of the training of scholars concerned with 
the study of public finance. He contends that 
the perspective gained by studying the origins 
of public choice analysis can offer an impor- 
tant stimulus to scientific progress. 

43007-0 Hardcover $44.95 


The Economics of Earnings 
Solomon W. Polachek and 
W. Stanley Siebert 


In examining the wages people earn, the jobs 
they do, and the effect of government labor 
market policies on wages and employment, 
this work synthesizes research findings using a 
human capital framework that no other similar 
book offers. 

36476-0 Hardcover $59.95 /36728-X Paper $19.95 


Japan's Capitalism: 

Creative Defeat and Beyond 
Shigeto Tsuru 

Foreword by John Kenneth Galbraith 
Japan’s defeat in the Second World War was 
total, but her subsequent reconstruction was 
extraordinary. In this book one of Japan’s most 
eminent economists gives a comprehensive 
account of the recovery process, and a unique 
interpretation of the postwar Japanese econ- 
omy, both as political economist, and as gov- 
ernment advisor over most of the period. He 
analyzes the welfare significance of Japan’s 
money-oriented affluence, and the emergence 
of a distinctive corporate capitalism. His 
conclusion is that Japan’s inspired creative 
response to defeat has itself led to a new set 
of intractable problems. 

Cambridge Studies in Economic Policies 
and Institutions 

36058-7 Hardcover $27.95 


incomparable Worth 
Pay Equity Meets the Market 
Steven E. Rhoads 


Challenging proponents of equal pay for 
comparable worth, Steven Rhoads argues 
that implementation has been plagued by 
critical and insurmountable problems. 
Where success has beer most frequently 
touted—in Minnesota, England, and 
Australia—job evaluation results are 
often arbitrary and political rather than 
objective. Rhoads cites cases where 
workers have been rated higher than 
their bosses, or where others previously 
paid equally for doing equal work have 
been assigned unequal pay. These and 
other bizarre outcomes have created 
severe problems for public and private 
managers. But the most significant costs 
of comparable worth go far beyond the 
administrative. By substituting admin- 
istered wages for market-determined 
ones, it produces gross inefficiency 

in labor markets. 

44187-0 Hardcover $24.95 


Also availabie by Steven Rhoads 


The Economist's View 
of the World: Government, 


Markets and Public Policy 


30160-2 Hardcover $49.50 
31764-9 Paper $17.95 


Price, Quality and Trust 

inter-firm Relations in Britain and Japan 
Mari Sako 

Long-term, close trading relations based on 
trust have come to be regarded as a key to the 
management of buyer-supplier relations in 
industry, while the arm’s length and adversarial 
commercial bargain has fallen out of favor. This 
book examines these contrasting approaches, 
and the link between them and corporate 
performance, through a study of the electron- 
ics industry in Britain and Japan, and offers 

a unique integration of multi-disciplinary 
theorizing and original comparative 

empirical work. 

Cambridge Studies in Management 18 
41386-9 Hardcover $49.95 
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Now in paperback... 


Leading Economic Indicators 
New Approaches and Forecasting Records 
Kajal Lahiri and Geoffrey H. Moore, 
Editors 

In recent years, significant new developments 
have emerged in the field of business cycles 
and economic indicators. This volume con- 
tains twenty-two articles by international 
experts who are working with new and 
innovative approaches. 

Contributors: Kajal Lahiri, Geoffrey Moore, Frank 
de Leeuw, Salib N. Neftci, James H. Stock, Mark W. 
Watson, Michael P. Niemira, Roy H. Webb, Arnold 
Zeliner, Chansik Hong, Stephen K. McNees, Hermon 
O. Stekler, Stephen J. Silver, Edward F. Renshaw, 
Allan P. Layton, Francis X. Diebold, Glenn D. 
Rudebusch, Victor Zarnowitz, Howard L. Roth, 
James M. Boughton, William H. Branson, Susmita 
Dasgupta, Ernst A. Boehm, Philip A. Klein, 

Feliks Tamm 

43858-6 Paper $22.95 


Uncertainty 

A Guide to Dealing with Uncertainty in 
Quantitative Risk and Policy Analysis 

M. Granger Morgan and 

Max Henrion 

“Uncertainty is a risk analysis textbook that 
Granger Morgan and Max Henrion have writ- 
ten to serve as a basic text for students as 
well as a reference work for practitioners and 
researchers. Uncertainty succeeds both in 
providing an excellent basis for policy analysis 
and in drawing upon a variety of case studies 
that will be of interest to risk researchers.” 


—Journal of Economic Literature 
42744-4 Paper $16.95 


The Market and Beyond 
Information Technology in Japan 
Martin Fransman 
“Martin Fransman has written a valuable book. 
This book should be important to, and studied 
and heeded by anyone with an interest or stake 
in the information technology field.” 
—American Journal of Economics 
and Sociology 
““Fransman’s analysis shows just how complex 
the Japanese system is. He also shows how cau- 
tious Japanese firms have been in entering into 
cooperative research agreements.” 


—Journal of Economic Issues 
43525-0 Paper $19.95 


Economically Speaking 


The Econometric Analysis of 
Transition Data 


Tony Lancaster 
“The analysis of transition/duration data is the 
next frontier beyond the now well-charted area 
of qualitative choice and limited-dependent 
variables. Lancaster’s book is an excellent 
travel guide to this relatively unknown but 
beautiful land where time-series analysis and 
qualitative choice merge. Even if they do 
not decide to stay, economists and social 
scientists will return enriched from a trip 
under Professor Lancaster’s guidance.” 

— Geert Ridder, University of Groningen 
Econometric Society Monographs 17 
43789-X Paper $17.95 


Beyond the Miracle 

of the Market 

The Political Economy of Agrarian 
Development in Kenya 

Robert H. Bates 


“The book succeeds in an empirically 
grounded synthesis of political and economic 
concepts—a synthesis that will challenge the 
thinking of academics and practitioners alike.” 
—American Anthropologist 
Emphasizing the political and economic 
origins of institutions and their role in eco- 
nomic development, this book examines the 
political economy of Kenya from the Mau Mau 
rebellion of the 1950s to the tragic droughts of 
the 1980s. 
Political Economy of Institutions 
and Decisions 
43792-X Paper $17.95 


Two-Sided Matching 
A Study in Game-Theoretic Modeling 
and Analysis 


Alvin E. Roth and 


Marilda A. Oliveira Sotomayor 
Two-sided matching provides a model of 
search processes such as those between firms 
and workers in labor markets or between buy- 
ers and sellers in auctions. This text provides 
a comprehensive account of recent results 
concerning the game-theoretic analysis of 
two-sided matching. 

Econometric Society Monograpbs 18 

43788-1 Paper $17.95 
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Economically Speaking 


The Elements of Accounting: 
An Introduction 
Geoffrey Whittington 


Assuming no previous knowledge or experi- 
ence in accounting, this concise introduction 
to the content and structure of accounting 
statements and the double entry system 
emphasizes fundamental principles rather 
than detailed practical procedures. 

41392-3 Hardcover $49.95 /42449-6 Paper $17.95 


Hungary: 

An Economy in Transition 
Istvan P. Székely and 

David M. G. Newbery, Editors 


The economic transformation of Hungary has 
reached a critical stage. This book presents 
some of the local arguments and perceptions 
informing the current debate, and critical 
examination of these ideas by an international 
panel of scholars. Chapters address the privati- 
sation program; reform of financial, tax and 
legal systems; integration into the international 
financial and monetary systems; labor markets, 
unemployment and the social safety net; and 
the political economy of the current economic 
transformation. 

Contributors: Istvan P. Székely, David M. G. 
Newbery, Laszl6 Csaba, Kalman Mizsei, Laszlo 
Halpern, Renzo Daviddi, Zsigmond Jarai, Péter 
Mibdlyi, Janos Standler, John P. Bonin, Rumen 
Dobrinsky, Paul Seabright, Julia Kiraly, Eva 
Varhegyi, Istvan Abel, Lucian Ionescu, Colin Mayer, 
Gabor Oblath, Werner Riecke, L. Alan Winters, 
Richard Portes, Tamas Sarkézy, Jeno Koltay, Athar 
Hussain, Janos Kollo, Maria Augusztinovics, 
David Winter, David Begg, R. E. Rowthorn, 

Sir Adam Ridley 

44018-1 Hardcover $59.95 


Finance and Development: 
issues and Experiences 


Alberto Giovannini, Editor 

This volume from the Centre for Economic 
Policy Research presents the latest theoretical 
and empirical research by leading experts on 
the relationship between financial market 
structure and economic growth in developed 
and developing countries. The issues discussed 
are of key importance to the future of eco- 
nomic integration in Western Europe, and the 
current economic transformation of Eastern 
Europe, and the future of lending to high-debt 
developing countries. 

Contributors: Alberto Giovannini, Paul Krugman, 
Oren Sussman, Bruce C. Greenwald, Alec Levinson, 
Joseph E. Stiglitz, Daniel Coben, Yung Chul Park, 
Riccardo Faini, Giampaolo Galli, Curzio Giannini, 
Ronald I. McKinnon, Juan Ramon Cuadrado, 
Guillermo de le Dehesa, Andrés Precedo, Philippe 
Aghion, Robin Burgess 

44017-3 Hardcover $54.95 


Trade Flows and 
Trade Policy after 1992 
L. Alan Winters, Editor 


This volume offers 2 major re-appraisal of the 
effect of completing the European Commu- 
nity’s single-market programme on trade 
among EC members states and also on the 
Community’s external trade. Individual chap- 
ters assess the effects of 1992 on competition 
if transport costs and taste differences remain; 
detailed case-studies show that integration has 
complete effects on trade flows and prices and 
stress the importance of devising a satisfactory 
international trade policy for the completed 
market; other chapters estimate the elasticities 
of demand for imports. 

Contributors: L. Alan Winters, Michael Gasiorek, 
Alasdair Smith, Anthony J. Venables, Jan I. 
Haaland, Victor D. Norman, Cillian Ryan, 

Ian Wooton, Gernot Kleper, Larry S. Karp, 

Jeffrey M. Perloff, Joaquim Oliveira-Martins, 

J6el Toujas-Bernate, Paul A. Brenton 

44020-3 Hardcover $54.95 


Available in bookstores or from 


CAMBRIDGE 


UNIVERSITY PRESS 


40 W. 20th St., N.Y., NY 10011-4211 
Call toll-free 800-872-7423 
MasterCard/VISA accepted. 

Prices subject to change. 
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The Foremost Economic 
Literature Database! 
Published by the 
American Economic Association 


Online 


Economic Literature Index 
File 139 on DIALOG® 


In-depth coverage of journal literature: citations from the Journal of Economic 
Literature since 1969, with abstracts since 1984. Also citations of articles in 
collective volumes since 1979. 


On Disc 


EconLit 
CD-ROM by SilverPlatter® 


Over two decades of citations and abstracts from the Journal of Economic 
Literature on a single disc! Includes abstracts of journal articles and books, 
citations of articles in collective volumes, and disseriation titles. 


In Print 


index of Economic Articles in Journals and Collective Volumes 


Over 100 years of economic literature indexed by publication year: 1886 
through 1988 in 30 volumes on your reference shelf! Make sure your set 
is up-to-date. 


For information, write or call: 


Journal of Economic Literature, RO. Box 7320, Pittsburgh, PA 
15213-0320 (412) 268-3869 
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Now available for individuals! 


AEA 
EconLit Members 
Save $50 


A SilverPlatter® CD-ROM published by 
The American Economic Association 


The established, reliable database for literature searches in economics derived from the 
JOURNAL OF ECONOMIC LITERATURE! 


Hardware Requirements 


IBM PC or compatible Apple Macintosh 

640 of RAM MacPlus, SE, or II series 
10 MB hard disk drive Two MB RAM 

PC DOS or MS DOS A hard disk drive 

ver. 3.1 or higher System 6.02 or higher 


MS DOS CD-ROM Extensions 2.0 or higher 
ISSO-compatible CD-ROM drive 


ORDER INFORMATION — Please check appropriate boxes: 

[_] Current disc (please allow 2-3 weeks for delivery of current disc) or 

[_] Next available disc (updated disc is available 2-3 months after JEL issue) 
IBM/MS-DOS 5 1/4 or 31/2 or [_] Apple Macintosh 


AEA MEMBERS: NON-MEMBERS: 
[-] $300* U.S. & Canada ($330 elsewhere) [ ] $350* U.S. & Canada ($385 elsewhere) 
Enclosed is my check for $ Charge $ to: 
payable to the [_]MasterCard or [_]VISA 
American Economic Association Account # 
(checks must be drawn on a U.S. bank, payable in U.S. dollars) 
Exp. Date 
(Name a appears on account) 
Signature 
Name 
Address 
City State/Province Zip 
Country Telephone ( ) 
*License restricted to personal use. Also includes SilverPlatter Information Retrieval System soft for EconLit. Prepaid 


orders only. CD may be purchased once a year only. No free trials. Postage included in the price. Library and department 
rates available on request. 


Mail to: American Economic Association or Call: PH: 412-268-3869 
Journal of Economic Literature - Dept. AE FAX: 412-268-6810 
P.O. Box 7320, Pittsburgh, PA 15213-0320 
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HECK’s ECONOMIC 
LITERATURE DATABASE 


For the Personal Computer 


EXPANDED VERSION NOW AVAILABLE! ONLY $495! 


That's right! Only $495 per year for a site license for the 
most user-friendly, comprehensive and lightening fast 
literature database available. And our site license allows for 
networking or use on any pc's throughout your campus. Plus, 
the newest version includes more than 200 leading journals. 


There isn't another literature database anywhere that even 
comes close. Your institution could easily spend over $2000 
per year for the competition. But why? No CD ROM drives 
needed here! With a few simple commands on your pe, all 
the relevant references can be retrieved by any keyword. 


* Includes over 125,000 bibliographic references from more 

than 200 leading economic journals from 1950 to present 

* Search command compiles listing by author, journal or any 
keyword contained in an article title 

* Narrow command reduces selected list by keyword, giving 
"and/or" capabilities 

* Browse command permits scrolling of index when keyword 
or its spelling is unknown 

* Print/Save command outputs selected references to your 
printer or an ASCII file 

* Typical search time of entire database by keyword is usually 
less than one second 

* Runs on IBM/compatibles with 16 MBytes of disk storage 

* Site license allows for use on all campus PCs or networking 

* Site license fee only $495 per year with annual updates 


To Order, Or For More Information Contact: 


Jean Louis Heck 
JLH Enterprises 
320 West Wayne Ave. 
Wayne, PA 19087 
Phone/FAX: 215-989-0776 
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Review of 


INTERNATIONAL 
ECONOMICS RTE 


Review of International Economics (RIE) is a new journal devoted to the publication 
of high-quality articles on topics in international economics: trade theory, 
commercial policy, economic integration, empirical research (e.g., agricultural 
trade), international finance, open economy macro issues, trade problems of 
developing and transition economies, GAIT negotiations, etc. Open to controversial 
and innovative papers. Contributions from related fields such as development and 
environmental economics are also welcome. Whether theoretical, empirical or 
policy-oriented, relevance to real world problems is a paramount concern. RIE is 
produced three times a year by Blackwell Publishers. 


Contributors (November 1992 issue): Paul Samuelson, Ronald Jones, Ravi Batra, 
Maurice Obstfeld, Stephen Turnovsky, Marcello Bianconi, and Alex McCalla. 


Best Article Awards: RIE will be offering two prizes each year: Best International 
Economics Article Award ($7,000), and Outstanding International Economics 
Article Award ($2,000: only assistant professors are eligible). 


Associate Editors: Hamid Beladi, Gene Grossman, Paul Krugman, Harvey Lapan; Book Review Editor: 
Francois Casas. Editorial Board: Ravi Batra, Wilfred Ethier, Jeffrey Frankel, Ronald Jones, 

Giancarlo Gandolfo, Murray Kemp, Stephen Magee, Sugata Marjit, Alex McCalla, Maurice Obstfeld, 
Michihiro Ohyama, Assaf Razin, Roy Ruffin, Andrew Schmitz, Bruno Solnik, Barbara Spencer, 

René Stulz, Akira Takayama, Edward Tower, Stephen Turnovsky, Alan Winters. 


Manuscripts (4 copies) should be sent to: E. Kwan Choi, Editor, Review of 
International Economics, Dept. of Economics, Iowa State University, Ames, IA 
50011. (515) 294-5999; Fax: (515) 294-6336. Submission fee: assistant professor, 
$20; others, $30; none for authors in developing countries). 


Subscription Rate: Join the International Economics and Finance 
Society (IEFS) for a reduced rate. Please send payment ($45 for IEFS 
membership with subscription to RIE, or $10 for membership only) to 
James Hartigan, Department of Economics, University of Oklahoma, 
Norman, OK 73019. (405) 325-5501. (For more information about 
IEFS, please contact James Hartigan or E. Kwan Choi.) 


The general subscription rate is $45 for individuals in N. America and 
£35 elsewhere. Send payment to: Paula Stewart, Journals Marketing, 
Blackwell Publishers, 108 Cowley Road, Oxford OX4 1JF. U.K. 
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The Experts Agree 


“| purchased GAUSS for my PC three years ago. ...After three days exploring the 
package, | was hooked. | became an avid GAUSS user, and remain so today...” 


EJ. Bedrick, Statistics and Computing 


“GAUSS is remarkable in its facilities and capabilities. Indeed in the experience of this 
reviewer the package is unprecedented in its scope, flexibility and power.” 


Peter J. Dolton, Journal of Economic Surveys 


“love the new debugging facility and the new graphics drivers. | am now using a 
IMB Super VGA and it's great to have full resolution available on it. |'m 
using a SOMHz 486 and the speed of GAUSS is just amazing...” 


David Boird, Biometrician, MAF Technology 


“The speediest and most versatile of the mathematical programming languages.” 
“It offers both quick graphics and magnificent presentation-quality graphics.” 


“GAUSS is a number cruncher par excellence.” 


Maximum Productivity 

If you work with numbers, the GAUSS programming lan- 
guage will make you more productive, giving you more 
answers to more questions, more quickly . . . guaranteed! 


Whatever language you are now using, you'll find that GAUSS 
greatly shortens development time. With GAUSS the basic unit 
of analysis is a matrix, resulting in a syntax that closely resem- 
bles common mathematical expressions. Since matrix opera- 
tions are assumed, most of the loops required by other lan- 
guages are not necessary in GAUSS. Over 400 functions 
including LINPACK and EISPACK routines are provided. 


GAUSS for Unix 

Now the power of GAUSS is available in the Unix environ- 
ment. Versions for IBM RS/6000 and Sun SPARC are currently 
shipping. Call for details and pricing. 


GAUSS Light” 

If you are looking for an economical yet powerful mathemati- 
cal tool, GAUSS Light - the fully compatible entry-level ver- 
sion of GAUSS may be just what you need. At just $150 for 
DOS and $295 for Unix, this is an exceptional value. 


Aptech Systems;4ne. (20 
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Barry Simon, PC Magazine 


GAUSS-386i 3.0 


Fast and flexible, version 3.0 incorporates many more func- 
tions and an intuitive user interface. 


Some of the latest features include: Full 32-bit Addressing, 
automatic handling of Complex Numbers, support for Virtual 
Memory (up to 4 GB), a powerful Source Level Debugger, a 
Matrix Editor, flexible Data Transformation capabilities, and ver- 
satile Multiple File Viewing/Importing Windows. GAUSS's 2D 
and 3D Publication Quality Graphics support SVGA 1024x768 
video, and most popular file formats including Color 
PostScript’. 

Newly updated Application Modules eliminate programming 
steps for many tasks, while maintaining flexibility for non- 


standard problems. One of the latest of these is an implemen- 


tation of the recently introduced Linear Algebra Library - 
LAPACK. 


GAUSS-386i 3.0 is just $995 and comes with a 30-day money 
back guarantee. Discounts available for Academic use. Call or 
write for a free brochure on GAUSS Systems and Applications. 


GAUSS, GAUSS-386, GAUSS-386i, and GAUSS Light are trademarks of Aptech Systems, Inc. Other trademarks 
are the property of their respective owners. Prices and specifications subject to change without notice. 
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